Please use this identifier to cite or link to this item: http://hdl.handle.net/10995/3058
Title: Could we automatically reproduce semantic relations of an information retrieval thesaurus?
Authors: Panchenko, A.
Issue Date: 2010
Publisher: Издательско-полиграфический центр Воронежского государственного университета
Citation: Panchenko, A. Could we automatically reproduce semantic relations of an information retrieval thesaurus? / A. Panchenko // IV Российская летняя школа по информационному поиску RuSSIR’2010, 13-18 сентября 2010 г. : труды Четвертой Российской конференции молодых ученых по информационному поиску. — Воронеж : Издательско-полиграфический центр Воронежского государственного университета, 2010. — С. 36-51.
Abstract: A well constructed thesaurus is recognized as a valuable source of semantic information for various applications, especially for Information Retrieval. The main hindrances to using thesaurus-oriented approaches are the high complexity and cost of manual thesauri creation. This paper addresses the problem of automatic thesaurus construction, namely we study the quality of automatically extracted semantic relations as compared with the semantic relations of a manually crafted thesaurus. The vector-space model based on syntactic contexts was used to reproduce relations between the terms of a manually constructed thesaurus. We propose a simple algorithm for representing both single word and multiword terms in the distributional space of syntactic contexts. Furthermore, we propose a method for evaluation quality of the extracted relations. Our experiments show significant difference between the automatically and manually constructed relations: while many of the automatically generated relations are relevant, just a small part of them could be found in the original thesaurus.
Keywords: THESAURUS
SEMANTIC RELATIONS
VECTOR-SPACE MODEL
DISTRIBUTIONAL ANALYSIS
MULTIWORD EXPRESSIONS
URI: http://hdl.handle.net/10995/3058
http://elar.urfu.ru/handle/10995/3058
Conference name: IV Russian Summer School in Information Retrieval (RuSSIR’2010)
IV Российская летняя школа по информационному поиску (RuSSIR’2010)
Conference date: 13.09.2010-18.09.2010
ISBN: 978-5-9273-1728-8
Origin: IV Российская летняя школа по информационному поиску RuSSIR’2010, 13-18 сентября 2010 г. : труды Четвертой Российской конференции молодых ученых по информационному поиску
Appears in Collections:Информационный поиск

Files in This Item:
File Description SizeFormat 
russir-2010-04.pdf1,57 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.