TY - JOUR
T1 - Linguistic resources for paraphrase generation in portuguese
T2 - a lexicon-grammar approach
AU - Barreiro, Anabela
AU - Mota, Cristina
AU - Baptista, Jorge
AU - Chacoto, Lucília
AU - Carvalho, Paula
N1 - info:eu-repo/grantAgreement/FCT/3599-PPCDT/EXPL%2FMHC-LIN%2F2260%2F2013/PT#
info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F50021%2F2020/PT#
info:eu-repo/grantAgreement/FCT/OE/SFRH%2FBPD%2F91446%2F2012/PT#
info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00657%2F2020/PT#
info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F00657%2F2020/PT#
EXPL/MHC-LIN/2260/2013
UIDB/50021/2020
UTAP EXPL/EEI-ESS/0031/2014
SFRH/BPD/91446/2012
UIDB/00657/2020
UIDP/00657/2020
PY - 2022/3
Y1 - 2022/3
N2 - This paper presents a new linguistic resource for the generation of paraphrases in Portuguese, based on the lexicon-grammar framework. The resource components include: (i) a lexicon-grammar based dictionary of 2100 predicate nouns co-occurring with the support verb ser de ‘be of’, such as in ser de uma ajuda inestimável ‘be of invaluable help’; (ii) a lexicon-grammar based dictionary of 6000 predicate nouns co-occurring with the support verb fazer ‘do’ or ‘make’, such as in fazer uma comparação ‘make a comparison’; and (iii) a lexicon-grammar based dictionary of about 5000 human intransitive adjectives co-occurring with the copula verbs ser and/or estar ‘be’, such as in ser simpático ‘be kind’ or estar entusiasmado ‘be enthusiastic’. A set of local grammars explore the properties described in linguistic resources, enabling a variety of text transformation tasks for paraphrasing applications. The paper highlights the different complementary and synergistic components and integration efforts, and presents some preliminary evaluation results on the inclusion of such resources in the eSPERTo paraphrase generation system.
AB - This paper presents a new linguistic resource for the generation of paraphrases in Portuguese, based on the lexicon-grammar framework. The resource components include: (i) a lexicon-grammar based dictionary of 2100 predicate nouns co-occurring with the support verb ser de ‘be of’, such as in ser de uma ajuda inestimável ‘be of invaluable help’; (ii) a lexicon-grammar based dictionary of 6000 predicate nouns co-occurring with the support verb fazer ‘do’ or ‘make’, such as in fazer uma comparação ‘make a comparison’; and (iii) a lexicon-grammar based dictionary of about 5000 human intransitive adjectives co-occurring with the copula verbs ser and/or estar ‘be’, such as in ser simpático ‘be kind’ or estar entusiasmado ‘be enthusiastic’. A set of local grammars explore the properties described in linguistic resources, enabling a variety of text transformation tasks for paraphrasing applications. The paper highlights the different complementary and synergistic components and integration efforts, and presents some preliminary evaluation results on the inclusion of such resources in the eSPERTo paraphrase generation system.
KW - Language Generation Resources
KW - Lexicon-Grammar
KW - Paraphrase Generator
KW - Paraphrasing
KW - Portuguese
UR - http://www.scopus.com/inward/record.url?scp=85123496680&partnerID=8YFLogxK
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000746292500001
U2 - 10.1007/s10579-021-09561-5
DO - 10.1007/s10579-021-09561-5
M3 - Article
AN - SCOPUS:85123496680
SN - 1574-020X
VL - 56
SP - 1
EP - 35
JO - Language Resources and Evaluation
JF - Language Resources and Evaluation
IS - 1
ER -