Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс: http://elar.urfu.ru/handle/10995/101728
Название: Proximity Full-Text Search by Means of Additional Indexes with Multi-component Keys: In Pursuit of Optimal Performance
Авторы: Veretennikov, A. B.
Дата публикации: 2019
Издатель: Springer Verlag
Библиографическое описание: Veretennikov A. B. Proximity Full-Text Search by Means of Additional Indexes with Multi-component Keys: In Pursuit of Optimal Performance / A. B. Veretennikov. — DOI 10.1007/978-3-030-23584-0_7 // Communications in Computer and Information Science. — 2019. — Vol. 1003. — P. 111-130.
Аннотация: Full-text search engines are important tools for information retrieval. In a proximity full-text search, a document is relevant if it contains query terms near each other, especially if the query terms are frequently occurring words. For each word in a text, we use additional indexes to store information about nearby words that are at distances from the given word of less than or equal to the MaxDistance parameter. We showed that additional indexes with three-component keys can be used to improve the average query execution time by up to 94.7 times if the queries consist of high-frequency occurring words. In this paper, we present a new search algorithm with even more performance gains. We consider several strategies for selecting multi-component key indexes for a specific query and compare these strategies with the optimal strategy. We also present the results of search experiments, which show that three-component key indexes enable much faster searches in comparison with two-component key indexes. © 2019, Springer Nature Switzerland AG.
Ключевые слова: ADDITIONAL INDEXES
FULL-TEXT SEARCH
INFORMATION RETRIEVAL
INVERTED INDEXES
PROXIMITY SEARCH
SEARCH ENGINES
TERM PROXIMITY
INFORMATION RETRIEVAL
ADDITIONAL INDEXES
FULL-TEXT SEARCH
INVERTED INDICES
PROXIMITY SEARCH
TERM PROXIMITY
SEARCH ENGINES
URI: http://elar.urfu.ru/handle/10995/101728
Условия доступа: info:eu-repo/semantics/openAccess
Идентификатор SCOPUS: 85069543644
Идентификатор PURE: 10263710
e8138734-bfe5-469b-9bc3-3b966bd9ff33
ISSN: 18650929
ISBN: 9783030235833
DOI: 10.1007/978-3-030-23584-0_7
Располагается в коллекциях:Научные публикации, проиндексированные в SCOPUS и WoS CC

Файлы этого ресурса:
Файл Описание РазмерФормат 
2-s2.0-85069543644.pdf1,12 MBAdobe PDFПросмотреть/Открыть


Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.