Repository logo
 
Publication

Self-Similarity Matrices and Localized Attention for Chorus Recognition: A Data-Efficient Music Information Retrieval Approach

datacite.subject.fosEngenharia e Tecnologia::Outras Engenharias e Tecnologias
dc.contributor.advisorMalheiro, Ricardo Manuel da Silva
dc.contributor.authorMena, Jose Daniel Luna
dc.date.accessioned2025-12-22T11:13:44Z
dc.date.available2025-12-22T11:13:44Z
dc.date.issued2025-11-25
dc.description.abstractThis project presents an efficient approach to chorus recognition in English song lyrics that achieves state-of-the-art performance with significantly fewer resources than existing methods. We developed a Bidirectional Long Short-Term Memory (BiLSTM) model with localized attention mechanisms, trained on only 780 songs compared to the 25,000+ songs typically used in Music Information Retrieval research. Our approach addresses class imbalance through comprehensive stabilization techniques and leverages nine feature views capturing structural, semantic, and rhythmic patterns via selfsimilarity matrices. Through systematic experimentation, we demonstrate that chorus detection relies primarily on local contextual patterns rather than global structural awareness, with head self-similarity features (line beginnings) proving most critical for segmentation. The BiLSTM + Attention model achieves 78.2% Macro F1 at the line level, matching Watanabe & Goto's (2020) performance with 100,000+ songs and significantly exceeding Fell et al.'s (2018) 67.4% F1 with 25,000 songs. For boundary detection, the model achieves 59.6% F1 for exact boundaries and 74.7% F1 with ±2 tolerance. The research demonstrates that strategic data curation, comprehensive feature engineering, and targeted optimization can compete effectively with resource-intensive approaches, showing that local pattern recognition outperforms complex global modeling strategies in specialized domains like lyric analysis.eng
dc.identifier.tid204092205
dc.identifier.urihttp://hdl.handle.net/10400.8/15177
dc.language.isoeng
dc.relationMobile Energy Resources in Grids of Electricity
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectLyric segmentation
dc.subjectChorus detection
dc.subjectAttention mechanisms
dc.subjectSelf-similarity matrices
dc.subjectLocal pattern recognition
dc.titleSelf-Similarity Matrices and Localized Attention for Chorus Recognition: A Data-Efficient Music Information Retrieval Approach
dc.typemaster thesis
dspace.entity.typePublication
oaire.awardTitleMobile Energy Resources in Grids of Electricity
oaire.awardURIhttp://hdl.handle.net/10400.8/15100
oaire.fundingStreamEnergy
relation.isProjectOfPublication4ad83ba5-ee4d-452c-8747-e55b7390f0ed
relation.isProjectOfPublication.latestForDiscovery4ad83ba5-ee4d-452c-8747-e55b7390f0ed
thesis.degree.nameMestrado em Ciências de Dados

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Project_2230396_Jose_Mena_MCD_c_f.pdf
Size:
3.15 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.32 KB
Format:
Item-specific license agreed upon to submission
Description: