Automatic transcription of music using deep learning techniques

Gil, André Ferreira

Publicação

Automatic transcription of music using deep learning techniques

2019-05-21Dissertação de mestrado

datacite.subject.fos	Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática	pt_PT
dc.contributor.advisor	Grilo, Carlos Fernando Almeida
dc.contributor.advisor	Domingues, Patrício Rodrigues
dc.contributor.advisor	Reis, Gustavo Miguel Jorge
dc.contributor.author	Gil, André Ferreira
dc.date.accessioned	2019-08-20T13:43:00Z
dc.date.available	2019-08-20T13:43:00Z
dc.date.issued	2019-05-21
dc.description.abstract	Music transcription is the problem of detecting notes that are being played in a musical piece. This is a difficult task that only trained people are capable of doing. Due to its difficulty, there have been a high interest in automate it. However, automatic music transcription encompasses several fields of research such as, digital signal processing, machine learning, music theory and cognition, pitch perception and psychoacoustics. All of this, makes automatic music transcription an hard problem to solve. In this work we present a novel approach of automatically transcribing piano musical pieces using deep learning techniques. We take advantage of deep learning techniques to build several classifiers, each one responsible for detecting only one musical note. In theory, this division of work would enhance the ability of each classifier to transcribe. Apart from that, we also apply two additional stages, pre-processing and post-processing, to improve the efficiency of our system. The pre-processing stage aims at improving the quality of the input data before the classification/transcription stage, while the post-processing aims at fixing errors originated during the classification stage. In the initial steps, preliminary experiments have been performed to fine tune our model, in both three stages: pre-processing, classification and post-processing. The experimental setup, using those optimized techniques and parameters, is shown and a comparison is given with other two state-of-the-art works that apply the same dataset as well as the same deep learning technique but using a different approach. By different approach we mean that a single neural network is used to detect all the musical notes rather than one neural network per each note. Our approach was able to surpass in frame-based metrics these works, while reaching close results in onset-based metrics, demonstrating the feasability of our approach.	pt_PT
dc.identifier.tid	202276716	pt_PT
dc.identifier.uri	http://hdl.handle.net/10400.8/4041
dc.language.iso	eng	pt_PT
dc.subject	Automatic music transcription	pt_PT
dc.subject	Multi-pitch estimation	pt_PT
dc.subject	Digital signal processing	pt_PT
dc.subject	Artificial neural networks	pt_PT
dc.subject	Machine learning and deep learning	pt_PT
dc.title	Automatic transcription of music using deep learning techniques	pt_PT
dc.type	master thesis
dspace.entity.type	Publication
rcaap.rights	openAccess	pt_PT
rcaap.type	masterThesis	pt_PT
thesis.degree.name	Mestrado em Engenharia Informática - Computação Móvel	pt_PT

Ficheiros

Principais

A mostrar 1 - 1 de 1

Nome:: André Ferreira Gil - Report_Model.pdf
Tamanho:: 2.7 MB
Formato:: Adobe Portable Document Format
Descrição:

Ver/Abrir

Licença

A mostrar 1 - 1 de 1

Nome:: license.txt
Tamanho:: 1.32 KB
Formato:: Item-specific license agreed upon to submission
Descrição:

Ver/Abrir

Coleções

ESTG - Mestrado em Engenharia Informática - Computação Móvel