Automatic indexing of scientific papers Presentation and results of DEFT 2016 text mining challenge

  • Beatrice Daille
  • Sabine Barreaux
  • Adrien Bougouin
  • Florian Boudin
  • Damien Cram
  • Amir Hazem


This paper presents the 2016 edition of the DEFT text mining challenge. This edition adresses the keyword-based indexing of scientific papers with the aim of simulating a professional indexer. The corpus is composed of French bibliographic records from four domains : linguistics, information science, archaeology and chemisty. The results have been evaluated in terms of precision, recall and f-measure computed on stemmed texts against a reference manual indexation.