Please use this identifier to cite or link to this item: http://elar.urfu.ru/handle/10995/4557
Title: Web page multi-label classification for filtering content from the web
Authors: Hresko, Juraj
Issue Date: 2012
Citation: Hresko J. Web page multi-label classification for filtering content from the web / J. Hresko // VI Russian Summer School in Information Retrieval, August 6–10, 2012. Proceedings of the Sixth Russian Young Scientists Conference in Information Retrieval / B. Sokolov, P. Braslavski (Eds.). — Yaroslavl, 2012. — P. 16-23.
Abstract: In this paper, we describe a simple approach to filter unwanted web pages, according to their content. The result of this work is a demo of an application that is usable in realtime filtering and in non-real-time indexing of any given web pages. We describe a proposed technique step by step, while discussing possible alternative ways for each part. In the end we discuss the overall quality and proposed next steps that could lead to a fully usable business application.
Keywords: ИНФОРМАТИКА
ИНФОРМАЦИОННЫЙ ПОИСК В ИНТЕРНЕТЕ
ПОИСК ИНФОРМАЦИИ В ИНТЕРНЕТЕ
КОНФЕРЕНЦИИ
MULTI-LABEL CLASSIFICATION
WEB MINING
MACHINE LEARNING
CONTENT FILTERING
URI: http://elar.urfu.ru/handle/10995/4557
Conference name: VI Российская летняя школа по информационному поиску (RuSSIR’2012)
VI Russian Summer School in Information Retrieval (RuSSIR’2012)
Conference date: 6.08.2012–10.08.2012
ISBN: 978-5-8397-0888-4
Origin: RuSSIR 2012
Appears in Collections:Информационный поиск

Files in This Item:
File Description SizeFormat 
RuSSIR_2012_02.pdf139,01 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.