Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс: http://elar.urfu.ru/handle/10995/103284
Название: RuBQ: A Russian Dataset for Question Answering over Wikidata
Авторы: Korablinov, V.
Braslavski, P.
Дата публикации: 2020
Издатель: Springer Science and Business Media Deutschland GmbH
Библиографическое описание: Korablinov V. RuBQ: A Russian Dataset for Question Answering over Wikidata / V. Korablinov, P. Braslavski. — DOI 10.1007/978-3-030-62466-8_7 // Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). — 2020. — Vol. 12507 LNCS. — P. 97-110.
Аннотация: The paper presents RuBQ, the first Russian knowledge base question answering (KBQA) dataset. The high-quality dataset consists of 1,500 Russian questions of varying complexity, their English machine translations, SPARQL queries to Wikidata, reference answers, as well as a Wikidata sample of triples containing entities with Russian labels. The dataset creation started with a large collection of question-answer pairs from online quizzes. The data underwent automatic filtering, crowd-assisted entity linking, automatic generation of SPARQL queries, and their subsequent in-house verification. The freely available dataset will be of interest for a wide community of researchers and practitioners in the areas of Semantic Web, NLP, and IR, especially for those working on multilingual question answering. The proposed dataset generation pipeline proved to be efficient and can be employed in other data annotation projects. © 2020, Springer Nature Switzerland AG.
Ключевые слова: EVALUATION
KNOWLEDGE BASE QUESTION ANSWERING
RUSSIAN LANGUAGE RESOURCES
SEMANTIC PARSING
KNOWLEDGE BASED SYSTEMS
LARGE DATASET
NATURAL LANGUAGE PROCESSING SYSTEMS
AUTOMATIC FILTERING
AUTOMATIC GENERATION
DATA ANNOTATION
KNOWLEDGE BASE
MACHINE TRANSLATIONS
QUESTION ANSWERING
QUESTION-ANSWER PAIRS
SPARQL QUERIES
SEMANTIC WEB
URI: http://elar.urfu.ru/handle/10995/103284
Условия доступа: info:eu-repo/semantics/openAccess
Идентификатор SCOPUS: 85096596949
Идентификатор PURE: 20220236
0428da86-88a9-4f2e-8056-6bc739fa0a8e
ISSN: 3029743
ISBN: 9783030624651
DOI: 10.1007/978-3-030-62466-8_7
Сведения о поддержке: We thank Mikhail Galkin, Svitlana Vakulenko, Daniil Sorokin, Vladimir Kovalenko, Yaroslav Golubev, and Rishiraj Saha Roy for their valuable comments and fruitful discussion on the paper draft. We also thank Pavel Bakhvalov, who helped collect RuWikidata8M sample and contributed to the first version of the entity linking tool. We are grateful to Yandex.Toloka for their data annotation grant. PB acknowledges support by Ural Mathematical Center under agreement No. 075-02-2020-1537/1 with the Ministry of Science and Higher Education of the Russian Federation.
Располагается в коллекциях:Научные публикации, проиндексированные в SCOPUS и WoS CC

Файлы этого ресурса:
Файл Описание РазмерФормат 
2-s2.0-85096596949.pdf544,34 kBAdobe PDFПросмотреть/Открыть


Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.