Measuring differentiability: Unmasking pseudonymous authors

Moshe Koppel, Jonathan Schier, Elisheva Bonchek-Dokow

Research output: Contribution to journalArticlepeer-review

189 Scopus citations

Abstract

In the authorship verification problem, we are given examples of the writing of a single author and are asked to determine if given long texts were or were not written by this author. We present a new learning-based method for adducing the "depth of difference" between two example sets and offer evidence that this method solves the authorship verification problem with very high accuracy. The underlying idea is to test the rate of degradation of the accuracy of learned models as the best features are iteratively dropped from the learning process.

Original languageEnglish
Pages (from-to)1261-1276
Number of pages16
JournalJournal of Machine Learning Research
Volume8
StatePublished - Jun 2007

Bibliographical note

Funding Information:
Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV “Consolider Ingenio 2010” program (CSD2007-00018) and iTrans2 (TIN2009-14511) projects. Also supported by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project and by the Generalitat Valenciana under grant Prometeo/2009/014 and GV/2010/067, and by the “Vicerrectorado de Investigación de la UPV” under grant 20091027.

Funding Information:
★Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN un-der the MIPRCV “Consolider Ingenio 2010” program (CSD2007-00018) and iTrans2 (TIN2009-14511) projects. Also supported by the Spanish MITyC under the eru-dito.com (TSI-020110-2009-439) project and by the Generalitat Valenciana under grant Prometeo/2009/014 and GV/2010/067, and by the “Vicerrectorado de Inves-tigación de la UPV” under grant 20091027.

Funding

Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN under the MIPRCV “Consolider Ingenio 2010” program (CSD2007-00018) and iTrans2 (TIN2009-14511) projects. Also supported by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project and by the Generalitat Valenciana under grant Prometeo/2009/014 and GV/2010/067, and by the “Vicerrectorado de Investigación de la UPV” under grant 20091027. ★Work supported by the EC (FEDER/FSE) and the Spanish MEC/MICINN un-der the MIPRCV “Consolider Ingenio 2010” program (CSD2007-00018) and iTrans2 (TIN2009-14511) projects. Also supported by the Spanish MITyC under the eru-dito.com (TSI-020110-2009-439) project and by the Generalitat Valenciana under grant Prometeo/2009/014 and GV/2010/067, and by the “Vicerrectorado de Inves-tigación de la UPV” under grant 20091027.

FundersFunder number
MITyCTSI-020110-2009-439
Vicerrectorado de Inves-tigación de la UPV20091027
Faculty of Science and Engineering, University of Manchester
European Commission
Federación Española de Enfermedades Raras
Ministerio de Economía y Competitividad
Generalitat ValencianaPrometeo/2009/014, GV/2010/067
Ministerio de Ciencia e InnovaciónCSD2007-00018, TIN2009-14511
European Regional Development Fund

    Keywords

    • Authorship attribution
    • One-class learning
    • Unmasking

    Fingerprint

    Dive into the research topics of 'Measuring differentiability: Unmasking pseudonymous authors'. Together they form a unique fingerprint.

    Cite this