Extracting knowledge-rich information from definitions: A corpus-based approach to building a conceptual-based terminological resource

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

38 Downloads (Pure)

Abstract

This paper aims to describe a text-mining approach on a domain corpus (cork) within the theoretical framework of the dual dimension of terminology to create a terminological dictionary and correlate it with an ontology. We will make some considerations on (i) domain specificities; (ii) lexical markers; (iii) automatic corpus processing using Sketch Engine; (iv) representation of lexical networks using CmapTools; and (v) representation of the concept system using Protégé. The goal of the ontology is to logically support the coherence and quality of the natural language definitions contained in the terminological resource.
Original languageEnglish
Title of host publicationProceedings of the 2nd International Conference on Multilingual Digital Terminology Today (MDTT 2023)
EditorsGiorgio Di Nunzio, Rute Costa, Federica Vezzani
PublisherRWTH Aachen University
Pages1-14
Number of pages14
Publication statusPublished - Jul 2023
EventMultilingual digital terminology today. Design, representation formats and management systems (MDTT 2023) - NOVA Medical School, Lisboa, Portugal
Duration: 29 Jun 202330 Jun 2023
http://mdtt2023.dei.unipd.it/en/

Publication series

NameCEUR Workshop Proceedings
PublisherRWTH Aachen University
Volume3427
ISSN (Electronic)1613-0073

Conference

ConferenceMultilingual digital terminology today. Design, representation formats and management systems (MDTT 2023)
Country/TerritoryPortugal
CityLisboa
Period29/06/2330/06/23
Internet address

Keywords

  • Terminology
  • Terminology Management
  • Corpus linguistics
  • Text mining
  • Knowledge organization
  • Ontology

Fingerprint

Dive into the research topics of 'Extracting knowledge-rich information from definitions: A corpus-based approach to building a conceptual-based terminological resource'. Together they form a unique fingerprint.

Cite this