A Document Descriptor Extractor Based on Relevant Expressions

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)


People are often asked to associate keywords to documents to enable applications to access the summarized core content of documents. This fact was the main motivation to work on an approach that may contribute to move from this manual procedure to an automatic one. Since Relevant Expressions (REs) or multi-word term expressions can be automatically extracted using the LocalMaxs algorithm, the most relevant ones can be used to describe the core content of each document. In this paper we present a language-independent approach for automatic generation of document descriptors. Results are shown for three different European languages and comparisons are made concerning different metrics for selecting the most informative REs of each document.
Original languageUnknown
Title of host publicationLecture Notes in Computer Science
ISBN (Print)978-3-642-04685-8
Publication statusPublished - 1 Jan 2009
EventEPIA 2009, Portuguese Conference on Artificial Inteligence -
Duration: 1 Jan 2009 → …


ConferenceEPIA 2009, Portuguese Conference on Artificial Inteligence
Period1/01/09 → …

Cite this