Automatic Extraction of Explicit and Implicit Keywords to Build Document Descriptors

Research output: Chapter in Book/Report/Conference proceedingChapter

3 Citations (Scopus)

Abstract

Keywords are single and multiword terms that describe the semantic content of documents. They are useful in many applications, such as document searching and indexing, or to be read by humans. Keywords can be explicit, by occurring in documents, or implicit, since, although not explicitly written in documents, they are semantically re- lated to their contents. This paper presents a statistical approach to build document descriptors with explicit and implicit keywords automatically extracted from the documents. Our approach is language-independent and we show comparative results for three different European languages.
Original languageUnknown
Title of host publicationProgress in Artificial Intelligence
EditorsLuís Correia, Luís Paulo Reis, José Cascalho
Place of PublicationHeidelberg
PublisherSpringer Berlin Heidelberg
Pages492-503
ISBN (Print)978-3-642-40668-3 / 978-3-642-40669-0
Publication statusPublished - 1 Jan 2013

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Berlin Heidelberg
Number8154
ISSN (Print)0302-9743

Cite this