Speaker Identification by Short Phrases Using Image Processing Procedure
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Kazanskii Gosudarstvennyi Universitet. Uchenye Zapiski. Seriya Fiziko-Matematichaskie Nauki, Tome 150 (2008) no. 1, pp. 107-114 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice du chapitre de livre

A new algorithm for speaker identification is suggested. A set of sound files belonging to two speakers is given. There are two short files known to be corresponding to two different speakers. The speakers are not supposed to have articulated the same phrase. The task is to establish belonging of each file in the set. The problem is solved using an approximation of spectral surface of sound file by wavelets of special kind. The compressed form of the spectral surface is processed by a neuron net. The decision about the belonging is made basing on the values produced by the neuron net. Some results of an experiment with files from a speech database are presented.
Keywords: speaker distinguishing, short phrases, graphical method.
@article{UZKU_2008_150_1_a10,
     author = {E. L. Stolov},
     title = {Speaker {Identification} by {Short} {Phrases} {Using} {Image} {Processing} {Procedure}},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {107--114},
     year = {2008},
     volume = {150},
     number = {1},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2008_150_1_a10/}
}
TY  - JOUR
AU  - E. L. Stolov
TI  - Speaker Identification by Short Phrases Using Image Processing Procedure
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2008
SP  - 107
EP  - 114
VL  - 150
IS  - 1
UR  - http://geodesic.mathdoc.fr/item/UZKU_2008_150_1_a10/
LA  - ru
ID  - UZKU_2008_150_1_a10
ER  - 
%0 Journal Article
%A E. L. Stolov
%T Speaker Identification by Short Phrases Using Image Processing Procedure
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2008
%P 107-114
%V 150
%N 1
%U http://geodesic.mathdoc.fr/item/UZKU_2008_150_1_a10/
%G ru
%F UZKU_2008_150_1_a10
E. L. Stolov. Speaker Identification by Short Phrases Using Image Processing Procedure. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Kazanskii Gosudarstvennyi Universitet. Uchenye Zapiski. Seriya Fiziko-Matematichaskie Nauki, Tome 150 (2008) no. 1, pp. 107-114. http://geodesic.mathdoc.fr/item/UZKU_2008_150_1_a10/

[1] Rosenberg A. E., Lee C.-H., Soong F. K., “Sub-word unit talker verification using hidden markov models”, Proc. ICASSP, 1990, 269–272

[2] Reynolds D. A., Rose R. C., “Robust text-independent speaker identification using gaussian mixture speaker models”, IEEE Trans. Speech and Audio Processing, 3 (1995), 72–83 | DOI

[3] Stolov E. L., “Identifikatsiya diktora na osnove otyskaniya osobykh tochek v proiznesennoi fraze”, Vestn. Tomsk. gos. un-ta. Prilozhenie, 2006, no. 17, 37–40

[4] Zilca R. D., “Text-Independent Speaker verification using utterance level scoring and covariance modeling”, IEEE Trans. Speech and Audio Processing, 10 (2002), 363–370 | DOI

[5] Liou H.-Sh., Mammone R. J., “Speaker verification using phoneme-based neural tree networks and phonetic weighting scoring method”, Proc. of the 1995 IEEE Workshop, 1995, 213–222

[6] Magrin-Chagnolleau I., Durou G., Bimbo F., “Application of time-frequency principal component analysis to text-independent speaker identification”, IEEE Trans. Speech and Audio Processing, 10 (2002), 371–378 | DOI

[7] Huang X., Acero A., Hon H.-W., Spoken language processing: A Guide to theory, algorithm, and system development, New Jersey Prentice-Hall, 2001, 965 pp.

[8] Gupta M. M., Liang Jin, Homma N., Static and Dynamic Neural Networks: From Fundamentals to Advanced Theory, Wiley-IEEE Press, Hoboken, NJ, 2003, 752 pp.

[9] Stolov E. L., Shlyannikov A. V., “Raspoznavanie lits na fotografii putem analiza kharakternykh oblastei”, Uchen. zap. Kazan. un-ta. Ser. Fiz.-matem. nauki, 149, kn. 2 (2007), 138–145