The Spectral Underpinning of word2vec

Ariel Jaffe, Yuval Kluger, Ofir Lindenbaum, Jonathan Patsenker, Erez Peterfreund, Stefan Steinerberger

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Word2vec introduced by Mikolov et al. is a word embedding method that is widely used in natural language processing. Despite its success and frequent use, a strong theoretical justification is still lacking. The main contribution of our paper is to propose a rigorous analysis of the highly nonlinear functional of word2vec. Our results suggest that word2vec may be primarily driven by an underlying spectral method. This insight may open the door to obtaining provable guarantees for word2vec. We support these findings by numerical simulations. One fascinating open question is whether the nonlinear properties of word2vec that are not captured by the spectral method are beneficial and, if so, by what mechanism.

Original languageEnglish
Article number593406
JournalFrontiers in Applied Mathematics and Statistics
Volume6
DOIs
StatePublished - 3 Dec 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© Copyright © 2020 Jaffe, Kluger, Lindenbaum, Patsenker, Peterfreund and Steinerberger.

Funding

YK is supported in part by NIH grants UM1DA051410, R01GM131642, P50CA121974 and R61DA047037. SS was funded by NSF-DMS 1763179 and the Alfred P. Sloan Foundation. EP has been partially supported by the Blavatnik Interdisciplinary Research Center (ICRC), the Federmann Research Center (Hebrew University) Israeli Science Foundation research grant no. 1523/16, and by the DARPA PAI program (Agreement No. HR00111890032, Dr. T. Senator). The authors thank James Garritano and the anonymous reviewers for their helpful feedback.

FundersFunder number
Blavatnik Interdisciplinary Research Center
Federmann Research Center
Hebrew University) Israeli Science Foundation1523/16
ICRC
James Garritano
NSF-DMS1763179
National Institutes of HealthUM1DA051410, P50CA121974, R61DA047037, R01GM131642
Defense Advanced Research Projects AgencyHR00111890032
Alfred P. Sloan Foundation

    Keywords

    • dimensionality reduction
    • nonlinear functional
    • skip-gram model
    • spectral method
    • word embedding
    • word2vec

    Fingerprint

    Dive into the research topics of 'The Spectral Underpinning of word2vec'. Together they form a unique fingerprint.

    Cite this