Basic word completion and prediction for hebrew

Yaakov Hacohen-Kerner, Izek Greenfield

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

This research aims to improve keystroke savings for completion and prediction of Hebrew words. This task is very important to augmentative and alternative communication systems as well as to search engines, short messages services, and mobile phones. The proposed model is composed of Hebrew corpora containing 177M words, a morphological analyzer, various n-gram Hebrew language models and other tools. The achieved keystroke savings rate is higher than those reported in a previous Hebrew word prediction system and previous word prediction systems in other languages. Two main findings have been found: the larger the corpus that the language model is trained on, the better predictions that are achieved and a morphological analyzer helps only when the language model is based on only one corpus.

Original languageEnglish
Title of host publicationString Processing and Information Retrieval - 19th International Symposium, SPIRE 2012, Proceedings
PublisherSpringer Verlag
Pages237-244
Number of pages8
ISBN (Print)9783642341083
DOIs
StatePublished - 2012
Externally publishedYes
Event19th International Symposium on String Processing and Information Retrieval, SPIRE 2012 - Cartagena de Indias, Colombia
Duration: 21 Oct 201225 Oct 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7608 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Symposium on String Processing and Information Retrieval, SPIRE 2012
Country/TerritoryColombia
CityCartagena de Indias
Period21/10/1225/10/12

Keywords

  • Augmentative and alternative communication
  • Corpora
  • Hebrew
  • Keystroke savings
  • Language models
  • Word completion
  • Word prediction

Fingerprint

Dive into the research topics of 'Basic word completion and prediction for hebrew'. Together they form a unique fingerprint.

Cite this