Publicado

2007-05-01

AN ARTIFICIAL IMMUNE SYSTEM BASED ON INFORMATION THEORY FOR KEYWORD EXTRACTION FROM TEXT DOCUMENTS

Palabras clave:

Keyword Extraction, Artificial Immune Systems, Information Theory. (es)

Descargas

Autores/as

  • ANDRÉS ROMERO. Ing.Laboratorio de Investigación en Sistemas Inteligentes Universidad Nacional de Colombia Bogotá, Colombia Sede Bogotá
  • FERNANDO NIÑO. PhD.Laboratorio de Investigación en Sistemas Inteligentes Universidad Nacional de Colombia Bogotá, Colombia Sede Bogotá
This paper presents a model for keyword extraction, extending the basic concepts commonly used in this task, in order to get a formal background that allows determining the importance of the keywords to the documents. The proposed model combines an artificial immune system with a mathematical background based on information theory; this new model has the advantage that does not need any domain knowledge, neither the use of a stopword list or any previous information about the content of the documents. The final result is a set of keywords for each category into the corpus used.