Mid-level features for audio chord recognition using a deep neural network
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 109-117
Voir la notice du chapitre de livre provenant de la source Math-Net.Ru
Deep neural networks composed of several pre-trained layers have been successfully applied to various tasks related to audio processing. Some configurations of deep neural networks (including deep recurrent networks) which can be pretrained with the help of stacked denoising autoencoders are proposed and examined in this paper in application to feature extraction for audio chord recognition task. The features obtained from an audio spectrogram using such network can be used instead of conventional chroma features to recognize the actual chords in the audio recording. Chord recognition quality that was achieved using the proposed features is compared to the one that was achieved using conventional chroma features which do not rely on any machine learning technique.
Keywords:
audio chord recognition, recurrent network, deep learning.
Mots-clés : autoencoder
Mots-clés : autoencoder
@article{UZKU_2013_155_4_a10,
author = {N. Glazyrin},
title = {Mid-level features for audio chord recognition using a~deep neural network},
journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
pages = {109--117},
publisher = {mathdoc},
volume = {155},
number = {4},
year = {2013},
language = {en},
url = {http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/}
}
TY - JOUR AU - N. Glazyrin TI - Mid-level features for audio chord recognition using a deep neural network JO - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki PY - 2013 SP - 109 EP - 117 VL - 155 IS - 4 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/ LA - en ID - UZKU_2013_155_4_a10 ER -
%0 Journal Article %A N. Glazyrin %T Mid-level features for audio chord recognition using a deep neural network %J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki %D 2013 %P 109-117 %V 155 %N 4 %I mathdoc %U http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/ %G en %F UZKU_2013_155_4_a10
N. Glazyrin. Mid-level features for audio chord recognition using a deep neural network. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 109-117. http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a10/