A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Sina Ahmadi, John P. McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S. Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, Thomas Troelsgard, Sussi Olsen, Simon Krek, Veronika Lipp, Tamás Váradi, László Simon, András Gyórffy, Carole Tiberius, Tanneke Schoonheim, Yifat Ben MosheMaya Rudich, Raya Abu Ahmad, Dorielle Lonke, Kira Kovalenko, Margit Langemets, Jelena Kallas, Oksana Dereza, Theodorus Fransen, David Cillessen, David Lindemann, Mikel Alonso, Ana de Castro Salgado, José Luis Sancho, Rafael-J. Ureña-Ruiz, Jordi Porta Zamorano, Kiril Simov, Petya Osenova, Zara Kancheva, Ivaylo Radev, Ranka Ranka Stankovic, Andrej Perdih, Dejan Gabrovsek

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Downloads (Pure)

Abstract

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language process-ing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment iscarried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships suchas broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide rangeof languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data willpave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriouslyrequiring data such as neural networks. Our resources are publicly available athttps://github.com/elexis-eu/MWSA.
Original languageEnglish
Title of host publicationProceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
EditorsNicoletta Calzolari, Frédéric Béche, Philippe Blach, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Place of PublicationParis
PublisherEuropean Language Resources Association (ELRA)
Pages3232–3242
Number of pages10
ISBN (Print)979-10-95546-36-8
Publication statusPublished - 2020
EventLanguage Resources and Evaluation Conference - Le Palais du Pharo, Marseilles, France
Duration: 13 May 202015 May 2020
Conference number: 12th
https://lrec2020.lrec-conf.org/en/

Conference

ConferenceLanguage Resources and Evaluation Conference
Abbreviated titleLREC 2020
CountryFrance
CityMarseilles
Period13/05/2015/05/20
Internet address

Keywords

  • Lexical semantics resoruces
  • Sense alignment
  • Lexicography
  • Language resource

Fingerprint

Dive into the research topics of 'A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment'. Together they form a unique fingerprint.

Cite this