Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach

Anabela Barreiro, Cristina Mota, Jorge Baptista, Lucília Chacoto, Paula Carvalho

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

This paper presents a new linguistic resource for the generation of paraphrases in Portuguese, based on the lexicon-grammar framework. The resource components include: (i) a lexicon-grammar based dictionary of 2100 predicate nouns co-occurring with the support verb ser de ‘be of’, such as in ser de uma ajuda inestimável ‘be of invaluable help’; (ii) a lexicon-grammar based dictionary of 6000 predicate nouns co-occurring with the support verb fazer ‘do’ or ‘make’, such as in fazer uma comparação ‘make a comparison’; and (iii) a lexicon-grammar based dictionary of about 5000 human intransitive adjectives co-occurring with the copula verbs ser and/or estar ‘be’, such as in ser simpático ‘be kind’ or estar entusiasmado ‘be enthusiastic’. A set of local grammars explore the properties described in linguistic resources, enabling a variety of text transformation tasks for paraphrasing applications. The paper highlights the different complementary and synergistic components and integration efforts, and presents some preliminary evaluation results on the inclusion of such resources in the eSPERTo paraphrase generation system.
Original languageEnglish
Pages (from-to)1-35
Number of pages35
JournalLanguage Resources and Evaluation
Volume56
Issue number1
DOIs
Publication statusPublished - Mar 2022

Keywords

  • Language Generation Resources
  • Lexicon-Grammar
  • Paraphrase Generator
  • Paraphrasing
  • Portuguese

Fingerprint

Dive into the research topics of 'Linguistic resources for paraphrase generation in portuguese: a lexicon-grammar approach'. Together they form a unique fingerprint.

Cite this