Detection of pauses between word fragments of speech recordings
Informacionnye tehnologii i vyčislitelnye sistemy, no. 1 (2022), pp. 40-46.

Voir la notice de l'article provenant de la source Math-Net.Ru

The paper considers the problem of segmentation of recordings of speech signals into segments generated in the presence of speech (word segments), and the pauses between them. This segmentation is an important stage in the identification of speech components based on some features. It is assumed that the segments of the speech signal in pauses of speech are samples from a stationary sequence of samples (noise in pauses). As the main characteristic of noises in pauses, it is proposed to use estimates from the training sample of the mathematical expectations of the energy parts of their segments of a certain finite duration in predetermined frequency bands (subband analysis). It is shown that the use of the maximum ratio of the energy parts of the current analyzed segment to the corresponding mathematical expectations segments of noise allows you to take into account the possible presence of a speech component to the maximum extent. This effect is equivalent to maximizing the signal-to-noise ratio, that is, the proposed decision function is optimal in this sense.
Keywords: segmentation of speech recordings, subband analysis, optimal decision function.
@article{ITVS_2022_1_a4,
     author = {E. G. Zhilyakov and S. P. Belov and A. S. Belov and A. A. Medvedeva},
     title = {Detection of pauses between word fragments of speech recordings},
     journal = {Informacionnye tehnologii i vy\v{c}islitelnye sistemy},
     pages = {40--46},
     publisher = {mathdoc},
     number = {1},
     year = {2022},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/ITVS_2022_1_a4/}
}
TY  - JOUR
AU  - E. G. Zhilyakov
AU  - S. P. Belov
AU  - A. S. Belov
AU  - A. A. Medvedeva
TI  - Detection of pauses between word fragments of speech recordings
JO  - Informacionnye tehnologii i vyčislitelnye sistemy
PY  - 2022
SP  - 40
EP  - 46
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/ITVS_2022_1_a4/
LA  - ru
ID  - ITVS_2022_1_a4
ER  - 
%0 Journal Article
%A E. G. Zhilyakov
%A S. P. Belov
%A A. S. Belov
%A A. A. Medvedeva
%T Detection of pauses between word fragments of speech recordings
%J Informacionnye tehnologii i vyčislitelnye sistemy
%D 2022
%P 40-46
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/ITVS_2022_1_a4/
%G ru
%F ITVS_2022_1_a4
E. G. Zhilyakov; S. P. Belov; A. S. Belov; A. A. Medvedeva. Detection of pauses between word fragments of speech recordings. Informacionnye tehnologii i vyčislitelnye sistemy, no. 1 (2022), pp. 40-46. http://geodesic.mathdoc.fr/item/ITVS_2022_1_a4/