People are often asked to associate keywords to documents to enable applications to access the summarized core content of documents. This fact was the main motivation to work on an approach that may contribute to move from this manual procedure to an automatic one. Since Relevant Expressions (REs) or multi-word term expressions can be automatically extracted using the LocalMaxs algorithm, the most relevant ones can be used to describe the core content of each document. In this paper we present a language-independent approach for automatic generation of document descriptors. Results are shown for three different European languages and comparisons are made concerning different metrics for selecting the most informative REs of each document.
|Title of host publication||Lecture Notes in Computer Science|
|Publication status||Published - 1 Jan 2009|
|Event||EPIA 2009, Portuguese Conference on Artificial Inteligence - |
Duration: 1 Jan 2009 → …
|Conference||EPIA 2009, Portuguese Conference on Artificial Inteligence|
|Period||1/01/09 → …|
Lopes, J. G. P., & Silva, J. F. F. (2009). A Document Descriptor Extractor Based on Relevant Expressions. In Lecture Notes in Computer Science (pp. 646-657). Springer-Verlag. https://doi.org/10.1007/978-3-642-04686-5_53