Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс:
http://elar.urfu.ru/handle/10995/75000
Название: | Watset: Automatic induction of synsets from a graph of synonyms |
Авторы: | Ustalov, D. Panchenko, A. Biemann, C. |
Дата публикации: | 2017 |
Издатель: | Association for Computational Linguistics (ACL) |
Библиографическое описание: | Ustalov D. Watset: Automatic induction of synsets from a graph of synonyms / D. Ustalov, A. Panchenko, C. Biemann // ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers). — 2017. — Vol. 1. — P. 1579-1590. |
Аннотация: | This paper presents a new graph-based approach that induces synsets using synonymy dictionaries and word embeddings. First, we build a weighted graph of synonyms extracted from commonly available resources, such as Wiktionary. Second, we apply word sense induction to deal with ambiguous words. Finally, we cluster the disambiguated version of the ambiguous input graph into synsets. Our meta-clustering approach lets us use an efficient hard clustering algorithm to perform a fuzzy clustering of the graph. Despite its simplicity, our approach shows excellent results, outperforming five competitive state-of-the-art methods in terms of F-score on three gold standard datasets for English and Russian derived from large-scale manually constructed lexical resources. © 2017 Association for Computational Linguistics. |
Ключевые слова: | COMPUTATIONAL LINGUISTICS GRAPHIC METHODS LINGUISTICS SEMANTICS AUTOMATIC INDUCTION CLUSTERING APPROACH GOLD STANDARDS HARD CLUSTERING ALGORITHMS LEXICAL RESOURCES STATE-OF-THE-ART METHODS WEIGHTED GRAPH WORD SENSE INDUCTIONS CLUSTERING ALGORITHMS |
URI: | http://elar.urfu.ru/handle/10995/75000 |
Условия доступа: | info:eu-repo/semantics/openAccess |
Конференция/семинар: | 55th Annual Meeting of the Association for Computational Linguistics, ACL 2017 |
Дата конференции/семинара: | 30 July 2017 through 4 August 2017 |
Идентификатор SCOPUS: | 85035077272 |
Идентификатор WOS: | 000493984800145 |
Идентификатор PURE: | 6430170 |
DOI: | 10.18653/v1/P17-1145 |
Сведения о поддержке: | We acknowledge the support of the Deutsche Forschungsgemeinschaft (DFG) foundation under the "JOIN-T" project, the DAAD, the RFBR under the project no. 16-37-00354 mol a, and the RFH under the project no. 16-04-12019. We also thank three anonymous reviewers for their helpful comments, Andrew Krizhanovsky for providing a parsed Wiktionary, Natalia Loukachevitch for the provided RuWordNet dataset, and Denis Shirgin who suggested the WATSET name. Amazon;Apple;Baidu;et al.;Google;Tencent |
Карточка проекта РНФ: | 16-04-12019 |
Располагается в коллекциях: | Научные публикации ученых УрФУ, проиндексированные в SCOPUS и WoS CC |
Файлы этого ресурса:
Файл | Описание | Размер | Формат | |
---|---|---|---|---|
10.18653-v1-p17-1145.pdf | 477,44 kB | Adobe PDF | Просмотреть/Открыть |
Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.