Improving the presentation of search results by multipartite graph clustering of multiple reformulated queries and a novel document representation

Lytkin, N.; Streltsov, S.; Perlovsky, L.; Muchnik, I.; Petrov, S.

Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс: http://elar.urfu.ru/handle/10995/59607

Название:	Improving the presentation of search results by multipartite graph clustering of multiple reformulated queries and a novel document representation
Авторы:	Lytkin, N. Streltsov, S. Perlovsky, L. Muchnik, I. Petrov, S.
Дата публикации:	2005
Издатель:	б. и.
Библиографическое описание:	Improving the presentation of search results by multipartite graph clustering of multiple reformulated queries and a novel document representation / N. Lytkin, S. Streltsov, L. Perlovsky, I. Muchnik, S. Petrov // Интернет-математика 2005. Автоматическая обработка веб-данных. — М., 2005.
Аннотация:	The goal of clustering web search results is to reveal the semantics of the retrieved documents. The main challenge is to make clustering partition relevant to a user’s query. In this paper, we describe a method of clustering search results using a similarity measure between documents retrieved by multiple reformulated queries. The method produces clusters of documents that are most relevant to the original query and, at the same time, represent a more diverse set of semantically related queries. In order to cluster thousands of documents in real time, we designed a novel multipartite graph clustering algorithm that has low polynomial complexity and no manually adjusted hyper–parameters. The loss of semantics resulting from the stem–based document representation is a common problem in information retrieval. To address this problem, we propose an alternative novel document representation, under which words are represented by their synonymy groups.
URI:	http://elar.urfu.ru/handle/10995/59607
Сведения о поддержке:	This work was supported by Yandex grant 110104.
Источники:	Интернет-математика 2005: автоматическая обработка веб-данных. — М., 2005
Располагается в коллекциях:	Информационный поиск

Файлы этого ресурса:

Файл	Описание	Размер	Формат
IMAT_2005_26.pdf		220,48 kB	Adobe PDF	Просмотреть/Открыть

Показать полное описание ресурса Статистика

Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.