Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс: http://elar.urfu.ru/handle/10995/3708
Полная запись метаданных
Поле DCЗначениеЯзык
dc.contributor.authorErmakova, L.en
dc.date.accessioned2011-10-12T09:31:27Z-
dc.date.available2011-10-12T09:31:27Z-
dc.date.issued2011-
dc.identifier.citationErmakova L. Transforming Message Detection / L. Ermakova // Web of Data: The joint RuSSIR/EDBT 2011 Summer School, August 15–19, 2011, Proceedings of the Fifth Russian Young Scientists Conference in Information Retrieval / B. Novikov, P. Braslavsky (Eds.). — St. Petersburg, 2011 — P. 15-29.ru
dc.identifier.isbn978-5-288-05225-5-
dc.identifier.urihttp://elar.urfu.ru/handle/10995/3708-
dc.description.abstractThe majority of existing spam filtering techniques suffers from several serious disadvantages. Some of them provide many false positives. The others are suitable only for email filtering and may not be used in IM and social networks. Therefore content methods seem to be more efficient. One of them is based on signature retrieval. However it is not change resistant. There are enhancements (e.g. checksums) but they are extremely time and resource consuming. That is why the main objective of this research is to develop a transforming message detection method. To this end we have compared spam in various languages, namely English, French, Russian and Italian. For each language the number of examined messages including spam and notspam was about 1000. 135 quantitative features have been retrieved. Almost all these features do not depend on the language. They underlie the first step of the algorithm based on support vector machine. The next stage is to test the obtained results applying N-gram approach. Special attention is paid to word distortion and text alteration. The obtaining results indicate the efficiency of the suggested approach.ru
dc.format.extent437949 bytesen
dc.format.mimetypeapplication/pdfen
dc.language.isoenen
dc.publisherSt. Petersburg University Pressru
dc.relation.ispartofRuSSIR/EDBT2011en
dc.subjectSPAMen
dc.subjectTRANSFORMING MESSAGEen
dc.subjectN-GRAMSen
dc.subjectSVMen
dc.subjectDAMERAU-LEVENSHTEIN DISTANCEen
dc.titleTransforming Message Detectionen
dc.typeArticleen
dc.typeinfo:eu-repo/semantics/articleen
dc.typeinfo:eu-repo/semantics/publishedVersionen
dc.conference.nameV Russian Summer School in Information Retrieval (RuSSIR’2011)en
dc.conference.nameV Российская летняя школа по информационному поиску (RuSSIR’2011)ru
dc.conference.nameEDBT Summer Schoolsen
dc.conference.date15.08.2011–19.08.2011-
Располагается в коллекциях:Информационный поиск

Файлы этого ресурса:
Файл Описание РазмерФормат 
RuSSIR_2011_02.pdf427,68 kBAdobe PDFПросмотреть/Открыть


Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.