Mid-level features for audio chord recognition using a deep neural network
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 109-117

Voir la notice du chapitre de livre provenant de la source Math-Net.Ru

Deep neural networks composed of several pre-trained layers have been successfully applied to various tasks related to audio processing. Some configurations of deep neural networks (including deep recurrent networks) which can be pretrained with the help of stacked denoising autoencoders are proposed and examined in this paper in application to feature extraction for audio chord recognition task. The features obtained from an audio spectrogram using such network can be used instead of conventional chroma features to recognize the actual chords in the audio recording. Chord recognition quality that was achieved using the proposed features is compared to the one that was achieved using conventional chroma features which do not rely on any machine learning technique.
Keywords: audio chord recognition, recurrent network, deep learning.
Mots-clés : autoencoder
@article{UZKU_2013_155_4_a10,
     author = {N. Glazyrin},
     title = {Mid-level features for audio chord recognition using a~deep neural network},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {109--117},
     publisher = {mathdoc},
     volume = {155},
     number = {4},
     year = {2013},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/}
}
TY  - JOUR
AU  - N. Glazyrin
TI  - Mid-level features for audio chord recognition using a deep neural network
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2013
SP  - 109
EP  - 117
VL  - 155
IS  - 4
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/
LA  - en
ID  - UZKU_2013_155_4_a10
ER  - 
%0 Journal Article
%A N. Glazyrin
%T Mid-level features for audio chord recognition using a deep neural network
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2013
%P 109-117
%V 155
%N 4
%I mathdoc
%U http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/
%G en
%F UZKU_2013_155_4_a10
N. Glazyrin. Mid-level features for audio chord recognition using a deep neural network. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 109-117. http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/