Improving IdSay: a characterization of strengths and weaknesses in Question Answering systems for Portuguese

DI Group Author

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

IdSay is a Question Answering system for Portuguese that participated at QA@CLEF 2008 with a baseline version (IdSayBL). Despite the encouraging results, there was still much room for improvement. The participation of six systems in the Portuguese task, with very good results either individually or in an hypothetical combination run, provided a valuable source of information. We made an analysis of all the answers submitted by all systems to identify their strengths and weaknesses. We used the conclusions of that analysis to guide our improvements, keeping in mind the two key characteristics we want for the system: efficiency in terms of response time and robustness to treat different types of data. As a result, an improved version of IdSay was developed, including as the most important enhancement the introduction of semantic information. We obtained significantly better results, from an accuracy in the first answer of 32.5\% in IdSayBL to 50.5\% in IdSay, without degradation of response time.
Original languageUnknown
Title of host publicationLecture Notes in Computer Science
PublisherSpringer-Verlag
Pages1-10
DOIs
Publication statusPublished - 1 Jan 2010
EventComputational Processing of the Portuguese Language (PROPOR) -
Duration: 1 Jan 2010 → …

Conference

ConferenceComputational Processing of the Portuguese Language (PROPOR)
Period1/01/10 → …

Cite this