the dataset description. 1. consists of pairs of files, .txt with full text, .key with keyphrases each on a new line. 2. there is the file "!authors.dat" inside the archive, it contains all authors for all papers. 3. how to cite? Use the bibtex: @TECHREPORT{key:dataset2009krapivin-autayeu-marchese, AUTHOR = {Mikalai Krapivin and Aliaksandr Autayeu and Maurizio Marchese}, TITLE = {Large Dataset for Keyphrases Extraction}, INSTITUTION = {DISI, Trento, Italy}, YEAR = {2008}, month = {May}, number = {DISI-09-055}, note = {http://eprints.biblio.unitn.it/archive/00001671/01/disi09055-krapivin-autayeu-marchese.pdf}, url = {\url{http://eprints.biblio.unitn.it/archive/00001671/01/disi09055-krapivin-autayeu-marchese.pdf}} }