TY - JOUR
T1 - Automatic extraction and learning of keyphrases from scientific articles
AU - HaCohen-Kerner, Yaakov
AU - Gross, Zuriel
AU - Masa, Asaf
PY - 2005
Y1 - 2005
N2 - Many academic journals and conferences require that each article include a list of keyphrases. These keyphrases should provide general information about the contents and the topics of the article. Keyphrases may save precious time for tasks such as filtering, summarization, and categorization. In this paper, we investigate automatic extraction and learning of keyphrases from scientific articles written in English. Firstly, we introduce various baseline extraction methods. Some of them, formalized by us, are very successful for academic papers. Then, we integrate these methods using different machine learning methods. The best results have been achieved by J48, an improved variant of C4.5. These results are significantly better than those achieved by previous extraction systems, regarded as the state of the art.
AB - Many academic journals and conferences require that each article include a list of keyphrases. These keyphrases should provide general information about the contents and the topics of the article. Keyphrases may save precious time for tasks such as filtering, summarization, and categorization. In this paper, we investigate automatic extraction and learning of keyphrases from scientific articles written in English. Firstly, we introduce various baseline extraction methods. Some of them, formalized by us, are very successful for academic papers. Then, we integrate these methods using different machine learning methods. The best results have been achieved by J48, an improved variant of C4.5. These results are significantly better than those achieved by previous extraction systems, regarded as the state of the art.
UR - http://www.scopus.com/inward/record.url?scp=24344441635&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-30586-6_74
DO - 10.1007/978-3-540-30586-6_74
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:24344441635
SN - 0302-9743
VL - 3406
SP - 657
EP - 669
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
T2 - 6th International Conference, CICLing 2005
Y2 - 13 February 2005 through 19 February 2005
ER -