Toward semantic image similarity from crowdsourced clustering

Yanir Kleiman, George Goldberg, Yael Amsterdamer, Daniel Cohen-Or

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

Determining the similarity between images is a fundamental step in many applications, such as image categorization, image labeling and image retrieval. Automatic methods for similarity estimation often fall short when semantic context is required for the task, raising the need for human judgment. Such judgments can be collected via crowdsourcing techniques, based on tasks posed to web users. However, to allow the estimation of image similarities in reasonable time and cost, the generation of tasks to the crowd must be done in a careful manner. We observe that distances within local neighborhoods provide valuable information that allows a quick and accurate construction of the global similarity metric. This key observation leads to a solution based on clustering tasks, comparing relatively similar images. In each query, crowd members cluster a small set of images into bins. The results yield many relative similarities between images, which are used to construct a global image similarity metric. This metric is progressively refined, and serves to generate finer, more local queries in subsequent iterations. We demonstrate the effectiveness of our method on datasets where ground truth is available, and on a collection of images where semantic similarities cannot be quantified. In particular, we show that our method outperforms alternative baseline approaches, and prove the usefulness of clustering queries, and of our progressive refinement process.

Original languageEnglish
Pages (from-to)1045-1055
Number of pages11
JournalVisual Computer
Volume32
Issue number6-8
DOIs
StatePublished - 1 Jun 2016

Bibliographical note

Publisher Copyright:
© 2016, Springer-Verlag Berlin Heidelberg.

Keywords

  • Crowdsourcing
  • Image distance metric
  • Image similarity

Fingerprint

Dive into the research topics of 'Toward semantic image similarity from crowdsourced clustering'. Together they form a unique fingerprint.

Cite this