Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case

Fahad Khan, Laurent Romary, Ana Salgado, Jack Bowers, Mohamed Khemakhen, Toma Tasovac

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)
14 Downloads (Pure)


In this article, we will introduce two of the new parts of the new multi-part version of the Lexical Markup Framework(LMF) ISO standard, namely Part 3 of the standard (ISO 24613-3), which deals with etymological and diachronic data, andPart 4 (ISO 24613-4), which consists of a TEI serialisation of all of the prior parts of the model. We will demonstrate the useof both standards by describing the LMF encoding of a small number of examples taken from a sample conversion of thereference Portuguese dictionaryGrande Dicion ́ario Houaiss da L ́ıngua Portuguesa, part of a broader experiment comprisingthe analysis of different, heterogeneously encoded, Portuguese lexical resources. We present the examples in the UnifiedModelling Language (UML) and also in a couple of cases in TEI.
Original languageEnglish
Title of host publicationProceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
EditorsNicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Place of PublicationParis
PublisherEuropean Language Resources Association (ELRA)
Number of pages8
Publication statusPublished - 2020
EventLanguage Resources and Evaluation Conference - Le Palais du Pharo, Marseilles, France
Duration: 13 May 202015 May 2020
Conference number: 12th


ConferenceLanguage Resources and Evaluation Conference
Abbreviated titleLREC 2020
Internet address


  • LMF
  • TEI
  • Portuguese Language Resources
  • Dictionaries


Dive into the research topics of 'Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case'. Together they form a unique fingerprint.

Cite this