Towards Russian summarization: can architecture solve data limitations problems?
    
    
  
  
  
      
      
      
        
Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part IV, Tome 540 (2024), pp. 5-26
    
  
  
  
  
  
    
      
      
        
      
      
      
    Voir la notice de l'article provenant de la source Math-Net.Ru
            
              			In this work, we investigate the automatic summarization problem, focusing on its significance, challenges, and methodologies, particularly in the context of the Russian language. We highlight the limitations of current evaluation metrics and datasets, representing diverse summarization scenarios. We study various approaches, including the formats of supervised fine-tuning, a comparison of models designed for Russian and those with cross-lingual capabilities, and the influence of reinforcement learning alignment on the final results. Contributions of this work include an examination of the summarization task for the Russian language, publication of a new instruction-based dataset and the best open-source model, and insights for further advances in the field.
			
            
            
            
          
        
      @article{ZNSL_2024_540_a0,
     author = {A. Akhmetgareeva and A. Abramov and I. Kuleshov and V. Leschuk and A. Fenogenova},
     title = {Towards {Russian} summarization: can architecture solve data limitations problems?},
     journal = {Zapiski Nauchnykh Seminarov POMI},
     pages = {5--26},
     publisher = {mathdoc},
     volume = {540},
     year = {2024},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/}
}
                      
                      
                    TY - JOUR AU - A. Akhmetgareeva AU - A. Abramov AU - I. Kuleshov AU - V. Leschuk AU - A. Fenogenova TI - Towards Russian summarization: can architecture solve data limitations problems? JO - Zapiski Nauchnykh Seminarov POMI PY - 2024 SP - 5 EP - 26 VL - 540 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/ LA - en ID - ZNSL_2024_540_a0 ER -
%0 Journal Article %A A. Akhmetgareeva %A A. Abramov %A I. Kuleshov %A V. Leschuk %A A. Fenogenova %T Towards Russian summarization: can architecture solve data limitations problems? %J Zapiski Nauchnykh Seminarov POMI %D 2024 %P 5-26 %V 540 %I mathdoc %U http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/ %G en %F ZNSL_2024_540_a0
A. Akhmetgareeva; A. Abramov; I. Kuleshov; V. Leschuk; A. Fenogenova. Towards Russian summarization: can architecture solve data limitations problems?. Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part IV, Tome 540 (2024), pp. 5-26. http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/