A detection method for mass-generated unnatural texts based on the
Numerical methods and programming, Tome 12 (2011) no. 3, pp. 58-72
Cet article a éte moissonné depuis la source Math-Net.Ru
Web spam is considered to be one of the greatest threats to modern search engines. Spammers use a wide range of algorithms to generate multiple unnatural texts. A new general model for texts generated from samples of natural texts is proposed. A new algorithm for detecting unnatural texts based on the topical structure analysis is also proposed. The proposed algorithm is evaluated on synthetic and real-world data.
Keywords:
web spam; topical structure; modeling.
@article{VMP_2011_12_3_a10,
author = {A. S. Pavlov and B. V. Dobrov},
title = {A detection method for mass-generated unnatural texts based on the},
journal = {Numerical methods and programming},
pages = {58--72},
year = {2011},
volume = {12},
number = {3},
language = {ru},
url = {http://geodesic.mathdoc.fr/item/VMP_2011_12_3_a10/}
}
A. S. Pavlov; B. V. Dobrov. A detection method for mass-generated unnatural texts based on the. Numerical methods and programming, Tome 12 (2011) no. 3, pp. 58-72. http://geodesic.mathdoc.fr/item/VMP_2011_12_3_a10/