Towards Russian summarization: can architecture solve data limitations problems?
Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part IV, Tome 540 (2024), pp. 5-26

Voir la notice de l'article provenant de la source Math-Net.Ru

In this work, we investigate the automatic summarization problem, focusing on its significance, challenges, and methodologies, particularly in the context of the Russian language. We highlight the limitations of current evaluation metrics and datasets, representing diverse summarization scenarios. We study various approaches, including the formats of supervised fine-tuning, a comparison of models designed for Russian and those with cross-lingual capabilities, and the influence of reinforcement learning alignment on the final results. Contributions of this work include an examination of the summarization task for the Russian language, publication of a new instruction-based dataset and the best open-source model, and insights for further advances in the field.
@article{ZNSL_2024_540_a0,
     author = {A. Akhmetgareeva and A. Abramov and I. Kuleshov and V. Leschuk and A. Fenogenova},
     title = {Towards {Russian} summarization: can architecture solve data limitations problems?},
     journal = {Zapiski Nauchnykh Seminarov POMI},
     pages = {5--26},
     publisher = {mathdoc},
     volume = {540},
     year = {2024},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/}
}
TY  - JOUR
AU  - A. Akhmetgareeva
AU  - A. Abramov
AU  - I. Kuleshov
AU  - V. Leschuk
AU  - A. Fenogenova
TI  - Towards Russian summarization: can architecture solve data limitations problems?
JO  - Zapiski Nauchnykh Seminarov POMI
PY  - 2024
SP  - 5
EP  - 26
VL  - 540
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/
LA  - en
ID  - ZNSL_2024_540_a0
ER  - 
%0 Journal Article
%A A. Akhmetgareeva
%A A. Abramov
%A I. Kuleshov
%A V. Leschuk
%A A. Fenogenova
%T Towards Russian summarization: can architecture solve data limitations problems?
%J Zapiski Nauchnykh Seminarov POMI
%D 2024
%P 5-26
%V 540
%I mathdoc
%U http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/
%G en
%F ZNSL_2024_540_a0
A. Akhmetgareeva; A. Abramov; I. Kuleshov; V. Leschuk; A. Fenogenova. Towards Russian summarization: can architecture solve data limitations problems?. Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part IV, Tome 540 (2024), pp. 5-26. http://geodesic.mathdoc.fr/item/ZNSL_2024_540_a0/