Methods of speech and text databases development for QA-systems
    
    
  
  
  
      
      
      
        
Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematika, mehanika, fizika, Tome 10 (2018) no. 3, pp. 59-66
    
  
  
  
  
  
    
      
      
        
      
      
      
    Voir la notice de l'article provenant de la source Math-Net.Ru
            
              			The paper is devoted to the problems of question-answer systems development (QA-systems). The subject of the study is discussion of approaches to the automatic filling of the database of the QA-system based on the analysis of the unstructured text sources currently available in the public domain of the Internet. 
The analysis reveals that the following ways of implementing QA-systems are distinguished: based on inference for ontologies, rules and syntax, using artificial neural networks. 
The methods for automatically search of question-answer pairs based on the structure of sentences and on the basis of associative-ontological analysis has been developed and tested in the research. 
The method based on the analysis of the structure of sentences is effective for texts such as lists of frequently asked questions (FAQ), as well as literature texts containing dialogs, direct speech, based on preliminary processing of the text, expressed in the form of a heuristic rule. 
The method based on associative-ontological analysis is focused to the class of reference and dictionary texts and is based on the assumption that in the descriptive text there is a sentence (or a group of sentences) containing the main idea of the text. In this case, the title of the text can be considered a question, and this sentence (or a group of sentences) is the answer. We need to make the selection of meaning-generating sentences due to the semantic reduction of the text automation. For this purpose, algorithms of self-referencing are applied based on the associative-ontological approach to the processing of texts in natural language. 
For the experimental verification of the possibility of creating an open QA-system based on the automatic collection of question-answer pairs from the Internet, a prototype of a collection module for the database of the QA-system has been developed.
			
            
            
            
          
        
      
                  
                    
                    
                    
                        
Keywords: 
question-answer pair, associative-ontological analysis, text, automatic text processing, natural language, speech recognition.
                    
                    
                    
                  
                
                
                @article{VYURM_2018_10_3_a6,
     author = {A. L. Ronzhin and A. A. Zaytseva and S. V. Kuleshov and K. V. Nenausnikov},
     title = {Methods of speech and text databases development for {QA-systems}},
     journal = {Vestnik \^U\v{z}no-Uralʹskogo gosudarstvennogo universiteta. Seri\^a, Matematika, mehanika, fizika},
     pages = {59--66},
     publisher = {mathdoc},
     volume = {10},
     number = {3},
     year = {2018},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/VYURM_2018_10_3_a6/}
}
                      
                      
                    TY - JOUR AU - A. L. Ronzhin AU - A. A. Zaytseva AU - S. V. Kuleshov AU - K. V. Nenausnikov TI - Methods of speech and text databases development for QA-systems JO - Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematika, mehanika, fizika PY - 2018 SP - 59 EP - 66 VL - 10 IS - 3 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/VYURM_2018_10_3_a6/ LA - en ID - VYURM_2018_10_3_a6 ER -
%0 Journal Article %A A. L. Ronzhin %A A. A. Zaytseva %A S. V. Kuleshov %A K. V. Nenausnikov %T Methods of speech and text databases development for QA-systems %J Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematika, mehanika, fizika %D 2018 %P 59-66 %V 10 %N 3 %I mathdoc %U http://geodesic.mathdoc.fr/item/VYURM_2018_10_3_a6/ %G en %F VYURM_2018_10_3_a6
A. L. Ronzhin; A. A. Zaytseva; S. V. Kuleshov; K. V. Nenausnikov. Methods of speech and text databases development for QA-systems. Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematika, mehanika, fizika, Tome 10 (2018) no. 3, pp. 59-66. http://geodesic.mathdoc.fr/item/VYURM_2018_10_3_a6/
