Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на этот ресурс:
http://elar.urfu.ru/handle/10995/130299
Полная запись метаданных
Поле DC | Значение | Язык |
---|---|---|
dc.contributor.author | Efimov, P. | en |
dc.contributor.author | Boytsov, L. | en |
dc.contributor.author | Arslanova, E. | en |
dc.contributor.author | Braslavski, P. | en |
dc.date.accessioned | 2024-04-05T16:18:01Z | - |
dc.date.available | 2024-04-05T16:18:01Z | - |
dc.date.issued | 2023 | - |
dc.identifier.citation | Efimov, P, Boytsov, L, Arslanova, E & Braslavski, P 2023, The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer: book chapter. в J Kamps & L Goeuriot (ред.), Advances in Information Retrieval: 45th European Conference on Information Retrieval: book. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Том. 13982, Springer Cham, стр. 51-67. https://doi.org/10.1007/978-3-031-28241-6_4 | harvard_pure |
dc.identifier.citation | Efimov, P., Boytsov, L., Arslanova, E., & Braslavski, P. (2023). The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer: book chapter. в J. Kamps, & L. Goeuriot (Ред.), Advances in Information Retrieval: 45th European Conference on Information Retrieval: book (стр. 51-67). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Том 13982). Springer Cham. https://doi.org/10.1007/978-3-031-28241-6_4 | apa_pure |
dc.identifier.isbn | 9783031282409 | - |
dc.identifier.issn | 0302-9743 | - |
dc.identifier.other | Final | 2 |
dc.identifier.other | All Open Access, Green | 3 |
dc.identifier.other | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85151051828&doi=10.1007%2f978-3-031-28241-6_4&partnerID=40&md5=6cf730ccd4a5490b5051ddd822241e89 | 1 |
dc.identifier.other | https://arxiv.org/pdf/2204.06457 | |
dc.identifier.uri | http://elar.urfu.ru/handle/10995/130299 | - |
dc.description.abstract | Large multilingual language models such as mBERT or XLM-R enable zero-shot cross-lingual transfer in various IR and NLP tasks. Cao et al. [8] proposed a data- and compute-efficient method for cross-lingual adjustment of mBERT that uses a small parallel corpus to make embeddings of related words across languages similar to each other. They showed it to be effective in NLI for five European languages. In contrast we experiment with a topologically diverse set of languages (Spanish, Russian, Vietnamese, and Hindi) and extend their original implementations to new tasks (XSR, NER, and QA) and an additional training regime (continual learning). Our study reproduced gains in NLI for four languages, showed improved NER, XSR, and cross-lingual QA results in three languages (though some cross-lingual QA gains were not statistically significant), while mono-lingual QA performance never improved and sometimes degraded. Analysis of distances between contextualized embeddings of related and unrelated words (across languages) showed that fine-tuning leads to “forgetting” some of the cross-lingual alignment information. Based on this observation, we further improved NLI performance using continual learning. Our software is publicly available https://github.com/pefimov/cross-lingual-adjustment. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG. | en |
dc.description.sponsorship | Russian Science Foundation, RSF: 20-11-20166 | en |
dc.description.sponsorship | Acknowledgment. This research was supported in part through computational resources of HPC facilities at HSE University [27]. PE is grateful to Yandex Cloud for their grant toward computing resources of Yandex DataSphere. PB acknowledges support by the Russian Science Foundation, grant No 20-11-20166. | en |
dc.format.mimetype | application/pdf | en |
dc.language.iso | en | en |
dc.publisher | Springer Science and Business Media Deutschland GmbH | en |
dc.relation | info:eu-repo/grantAgreement/RSF//20-11-20166 | en |
dc.rights | info:eu-repo/semantics/openAccess | en |
dc.source | Advances in Information Retrieval | 2 |
dc.source | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en |
dc.subject | CROSS-LINGUAL TRANSFER | en |
dc.subject | MULTILINGUAL EMBEDDINGS | en |
dc.subject | NATURAL LANGUAGE PROCESSING SYSTEMS | en |
dc.subject | ZERO-SHOT LEARNING | en |
dc.subject | CONTEXTUAL WORDS | en |
dc.subject | CONTINUAL LEARNING | en |
dc.subject | CROSS-LINGUAL | en |
dc.subject | CROSS-LINGUAL TRANSFER | en |
dc.subject | EMBEDDINGS | en |
dc.subject | LANGUAGE MODEL | en |
dc.subject | MULTILINGUAL EMBEDDING | en |
dc.subject | PARALLEL CORPORA | en |
dc.subject | PERFORMANCE | en |
dc.subject | WORD REPRESENTATIONS | en |
dc.subject | EMBEDDINGS | en |
dc.title | The Impact of Cross-Lingual Adjustment of Contextual Word Representations on Zero-Shot Transfer | en |
dc.type | Conference Paper | en |
dc.type | info:eu-repo/semantics/conferenceObject | en |
dc.type | info:eu-repo/semantics/submittedVersion | en |
dc.conference.name | 45th European Conference on Information Retrieval, ECIR 2023 | en |
dc.conference.date | 2 April 2023 through 6 April 2023 | - |
dc.identifier.doi | 10.1007/978-3-031-28241-6_4 | - |
dc.identifier.scopus | 85151051828 | - |
local.contributor.employee | Efimov, P., ITMO University, Saint Petersburg, Russian Federation | en |
local.contributor.employee | Boytsov, L., Bosch Center for Artificial Intelligence, Pittsburgh, United States | en |
local.contributor.employee | Arslanova, E., Ural Federal University, Yekaterinburg, Russian Federation | en |
local.contributor.employee | Braslavski, P., Ural Federal University, Yekaterinburg, Russian Federation, HSE University, Moscow, Russian Federation | en |
local.description.firstpage | 51 | - |
local.description.lastpage | 67 | - |
local.volume | 13982 LNCS | - |
dc.identifier.wos | 000995495200004 | - |
local.contributor.department | ITMO University, Saint Petersburg, Russian Federation | en |
local.contributor.department | Bosch Center for Artificial Intelligence, Pittsburgh, United States | en |
local.contributor.department | Ural Federal University, Yekaterinburg, Russian Federation | en |
local.contributor.department | HSE University, Moscow, Russian Federation | en |
local.identifier.pure | 37140299 | - |
local.identifier.eid | 2-s2.0-85151051828 | - |
local.fund.rsf | 20-11-20166 | - |
local.identifier.wos | WOS:000995495200004 | - |
Располагается в коллекциях: | Научные публикации ученых УрФУ, проиндексированные в SCOPUS и WoS CC |
Файлы этого ресурса:
Файл | Описание | Размер | Формат | |
---|---|---|---|---|
2-s2.0-85151051828.pdf | 323 kB | Adobe PDF | Просмотреть/Открыть |
Все ресурсы в архиве электронных ресурсов защищены авторским правом, все права сохранены.