Overview of uni-modal and multi-modal representations for classification tasks

Aryeh Wiesen, Yaakov HaCohen-Kerner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Classification is one of the most fundamental tasks in data mining and machine learning. It is being applied in an increasing number of fields, e.g. filtering, identification, information retrieval, information extraction, and similarity detection. A basic and necessary condition for the success of a classification task is the proper representation of the information it wishes to classify. Classification is needed in domains that are based on uni-modal representations such as text, images, audio, and speech, as well as in domains that are based on multi-modal representations. This paper aims to provide a short review on the developing area of multi-modal representations for classification with emphasis on state-of-the-art systems in this area. Firstly, fundamentals of uni-modal representations are given. Secondly, an overview of multi-modal representations is given. Thirdly, various related systems using multi-modal representations and the datasets used by them are briefly summarized with a comparative summary of these systems.

Original languageEnglish
Title of host publicationNatural Language Processing and Information Systems - 23rd International Conference on Applications of Natural Language to Information Systems, NLDB 2018, Proceedings
EditorsFarid Meziane, Max Silberztein, Faten Atigui, Elena Kornyshova, Elisabeth Metais
PublisherSpringer Verlag
Pages397-404
Number of pages8
ISBN (Print)9783319919461
DOIs
StatePublished - 2018
Externally publishedYes
Event23rd International Conference on Natural Language and Information Systems, NLDB 2018 - Paris, France
Duration: 13 Jun 201815 Jun 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10859 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference23rd International Conference on Natural Language and Information Systems, NLDB 2018
Country/TerritoryFrance
CityParis
Period13/06/1815/06/18

Bibliographical note

Publisher Copyright:
© 2018, Springer International Publishing AG, part of Springer Nature.

Keywords

  • Classification
  • Multi-modal representation
  • Textual features
  • Uni-modal representation
  • Visual features

Fingerprint

Dive into the research topics of 'Overview of uni-modal and multi-modal representations for classification tasks'. Together they form a unique fingerprint.

Cite this