Publication
Self-Similarity Matrices and Localized Attention for Chorus Recognition: A Data-Efficient Music Information Retrieval Approach
| datacite.subject.fos | Engenharia e Tecnologia::Outras Engenharias e Tecnologias | |
| dc.contributor.advisor | Malheiro, Ricardo Manuel da Silva | |
| dc.contributor.author | Mena, Jose Daniel Luna | |
| dc.date.accessioned | 2025-12-22T11:13:44Z | |
| dc.date.available | 2025-12-22T11:13:44Z | |
| dc.date.issued | 2025-11-25 | |
| dc.description.abstract | This project presents an efficient approach to chorus recognition in English song lyrics that achieves state-of-the-art performance with significantly fewer resources than existing methods. We developed a Bidirectional Long Short-Term Memory (BiLSTM) model with localized attention mechanisms, trained on only 780 songs compared to the 25,000+ songs typically used in Music Information Retrieval research. Our approach addresses class imbalance through comprehensive stabilization techniques and leverages nine feature views capturing structural, semantic, and rhythmic patterns via selfsimilarity matrices. Through systematic experimentation, we demonstrate that chorus detection relies primarily on local contextual patterns rather than global structural awareness, with head self-similarity features (line beginnings) proving most critical for segmentation. The BiLSTM + Attention model achieves 78.2% Macro F1 at the line level, matching Watanabe & Goto's (2020) performance with 100,000+ songs and significantly exceeding Fell et al.'s (2018) 67.4% F1 with 25,000 songs. For boundary detection, the model achieves 59.6% F1 for exact boundaries and 74.7% F1 with ±2 tolerance. The research demonstrates that strategic data curation, comprehensive feature engineering, and targeted optimization can compete effectively with resource-intensive approaches, showing that local pattern recognition outperforms complex global modeling strategies in specialized domains like lyric analysis. | eng |
| dc.identifier.tid | 204092205 | |
| dc.identifier.uri | http://hdl.handle.net/10400.8/15177 | |
| dc.language.iso | eng | |
| dc.relation | Mobile Energy Resources in Grids of Electricity | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Lyric segmentation | |
| dc.subject | Chorus detection | |
| dc.subject | Attention mechanisms | |
| dc.subject | Self-similarity matrices | |
| dc.subject | Local pattern recognition | |
| dc.title | Self-Similarity Matrices and Localized Attention for Chorus Recognition: A Data-Efficient Music Information Retrieval Approach | |
| dc.type | master thesis | |
| dspace.entity.type | Publication | |
| oaire.awardTitle | Mobile Energy Resources in Grids of Electricity | |
| oaire.awardURI | http://hdl.handle.net/10400.8/15100 | |
| oaire.fundingStream | Energy | |
| relation.isProjectOfPublication | 4ad83ba5-ee4d-452c-8747-e55b7390f0ed | |
| relation.isProjectOfPublication.latestForDiscovery | 4ad83ba5-ee4d-452c-8747-e55b7390f0ed | |
| thesis.degree.name | Mestrado em Ciências de Dados |
