Cross-partition clustering: Revealing corresponding themes across related datasets

Zvika Marx, Ido Dagan, Eli Shamir

Research output: Contribution to journalArticlepeer-review

Abstract

This article studies the task of discovering correspondences across related domains based on real-world data collections. We address this task through a designated extension of distributional data-clustering methods. The method is empirically demonstrated on synthetic data as well as on texts addressing different religions, where the goal is to identify commonalities shared by all religions. This article generalises and demonstrates the empirical improvement relative to our previous studies on this subject, as well as to other comparable methods.

Original languageEnglish
Pages (from-to)153-180
Number of pages28
JournalJournal of Experimental and Theoretical Artificial Intelligence
Volume23
Issue number2
DOIs
StatePublished - Jun 2011

Keywords

  • analogy
  • data clustering
  • information theory
  • natural language processing
  • structure mapping
  • text mining

Fingerprint

Dive into the research topics of 'Cross-partition clustering: Revealing corresponding themes across related datasets'. Together they form a unique fingerprint.

Cite this