Blending of predictions boosts understanding for multimodal advertisements
Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part II–1, Tome 529 (2023), pp. 176-196

Voir la notice de l'article provenant de la source Math-Net.Ru

The advertising industry employs several content modalities to deliver implied messages: images, videos, text, music, and all of them combined. “Decoding” a message implied by multimodal content often requires both text and visual components. We study the tasks of multimodal symbolism prediction, topic detection, and sentiment type classification. Motivated by the difference in parts of the message conveyed by two modalities in advertisements, we train separate models for images and texts and significantly improve upon current state of the art by blending image- and text-based predictions (with OCR-extracted text), providing a comprehensive experimental validation of our approach.
@article{ZNSL_2023_529_a11,
     author = {A. Alekseev and A. Savchenko and E. Tutubalina and E. Myasnikov and S. Nikolenko},
     title = {Blending of predictions boosts understanding for multimodal advertisements},
     journal = {Zapiski Nauchnykh Seminarov POMI},
     pages = {176--196},
     publisher = {mathdoc},
     volume = {529},
     year = {2023},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/ZNSL_2023_529_a11/}
}
TY  - JOUR
AU  - A. Alekseev
AU  - A. Savchenko
AU  - E. Tutubalina
AU  - E. Myasnikov
AU  - S. Nikolenko
TI  - Blending of predictions boosts understanding for multimodal advertisements
JO  - Zapiski Nauchnykh Seminarov POMI
PY  - 2023
SP  - 176
EP  - 196
VL  - 529
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/ZNSL_2023_529_a11/
LA  - en
ID  - ZNSL_2023_529_a11
ER  - 
%0 Journal Article
%A A. Alekseev
%A A. Savchenko
%A E. Tutubalina
%A E. Myasnikov
%A S. Nikolenko
%T Blending of predictions boosts understanding for multimodal advertisements
%J Zapiski Nauchnykh Seminarov POMI
%D 2023
%P 176-196
%V 529
%I mathdoc
%U http://geodesic.mathdoc.fr/item/ZNSL_2023_529_a11/
%G en
%F ZNSL_2023_529_a11
A. Alekseev; A. Savchenko; E. Tutubalina; E. Myasnikov; S. Nikolenko. Blending of predictions boosts understanding for multimodal advertisements. Zapiski Nauchnykh Seminarov POMI, Investigations on applied mathematics and informatics. Part II–1, Tome 529 (2023), pp. 176-196. http://geodesic.mathdoc.fr/item/ZNSL_2023_529_a11/