Classification of speech files by waveforms
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 156 (2014) no. 4, pp. 39-46 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice du chapitre de livre

The paper presents a new approach to the classification of speech files according to a language being used. Some parameters describing the form of individual signals of a speech file are suggested, such as the position of quantiles in a selected fragment and the parameters of parabolas approximating area under curve. It is proved that distribution of these parameters can be used for automatic differentiation of files containing Tatar and Russian speech.
Keywords: automatic differentiation of languages
Mots-clés : form of a file fragment, quantiles, approximation by parabolas.
@article{UZKU_2014_156_4_a4,
     author = {R. Kh. Latypov and R. R. Nigmatullin and E. L. Stolov},
     title = {Classification of speech files by waveforms},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {39--46},
     year = {2014},
     volume = {156},
     number = {4},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2014_156_4_a4/}
}
TY  - JOUR
AU  - R. Kh. Latypov
AU  - R. R. Nigmatullin
AU  - E. L. Stolov
TI  - Classification of speech files by waveforms
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2014
SP  - 39
EP  - 46
VL  - 156
IS  - 4
UR  - http://geodesic.mathdoc.fr/item/UZKU_2014_156_4_a4/
LA  - ru
ID  - UZKU_2014_156_4_a4
ER  - 
%0 Journal Article
%A R. Kh. Latypov
%A R. R. Nigmatullin
%A E. L. Stolov
%T Classification of speech files by waveforms
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2014
%P 39-46
%V 156
%N 4
%U http://geodesic.mathdoc.fr/item/UZKU_2014_156_4_a4/
%G ru
%F UZKU_2014_156_4_a4
R. Kh. Latypov; R. R. Nigmatullin; E. L. Stolov. Classification of speech files by waveforms. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 156 (2014) no. 4, pp. 39-46. http://geodesic.mathdoc.fr/item/UZKU_2014_156_4_a4/

[1] Li H., Ma B., Lee C.-H., “A Vector space modeling approach to spoken language identification”, IEEE Trans. Audio, Speech, Language Process, 15:1 (2007), 271–284 | DOI

[2] Campbell W. M., Campbell J. P., Reynolds D. A., Singer E., Torres-Carrasquillo P. A., “Support vector machines for speaker and language recognition”, Computer Speech Language, 20:2–3 (2006), 210–229 | DOI

[3] Siniscalchi S. M., Reed J., Svendsen T., Lee C.-H., “Universal attribute characterization of spoken languages for automatic spoken language recognition”, Computer Speech Language, 27:1 (2013), 209–227 | DOI

[4] Koolagudi S. G., Rastogi D., Rao K. S., “Spoken language identification using spectral features”, Commun. Comput. Inform. Sci., 306 (2012), 496–497 | DOI

[5] Newman J. L., Cox S. J., “Language identification using visual features”, IEEE Trans. Audio, Speech, Language Process, 20:7 (2012), 1936–1947 | DOI

[6] Diehl R. L., Lotto A. J., Holt L. L., “Speech perception”, Annu. Rev. Psychol., 55 (2004), 149–179 | DOI

[7] Nigmatullin R. R., Stolov E. L., “Opredelenie vremeni ustanovleniya vokalizatsii v slogakh, nachinayuschikhsya s glukhoi soglasnoi”, Vestn. KGTU im. A. N. Tupoleva, 2011, no. 1, 159–163

[8] Nigmatullin R. R., Stolov E. L., “Podkhod k zadache avtomaticheskogo opredeleniya yazyka v rechevom faile”, Trudy seminara “Metody modelirovaniya”, 5, ed. V. A. Raikhlin, AN RT, Kazan, 2013, 157–163

[9] Nigmatullin R. R., Stolov E. L., “Parametry, kharakterizuyuschie lokalnye fragmenty rechevykh failov”, Uchen. zap. Kazan. un-ta. Ser. Fiz.-matem. nauki, 155, no. 2, 2013, 100–107

[10] Van der Varden B. L., Matematicheskaya statistika, Izd-vo inostr. lit., M., 1960, 435 pp. | MR