Determining if two documents are written by the same author

Moshe Koppel, Yaron Winter

Research output: Contribution to journalArticlepeer-review

171 Scopus citations

Abstract

Almost any conceivable authorship attribution problem can be reduced to one fundamental problem: whether a pair of (possibly short) documents were written by the same author. In this article, we offer an (almost) unsupervised method for solving this problem with surprisingly high accuracy. The main idea is to use repeated feature subsampling methods to determine if one document of the pair allows us to select the other from among a background set of impostors in a sufficiently robust manner.

Original languageEnglish
Pages (from-to)178-187
Number of pages10
JournalJournal of the American Society for Information Science and Technology
Volume65
Issue number1
DOIs
StatePublished - Jan 2014

Fingerprint

Dive into the research topics of 'Determining if two documents are written by the same author'. Together they form a unique fingerprint.

Cite this