TY - JOUR
T1 - HAADS
T2 - A Hebrew Aramaic abbreviation disambiguation system
AU - HaCohen-Kerner, Yaakov
AU - Kass, Ariel
AU - Peretz, Ariel
PY - 2010/9
Y1 - 2010/9
N2 - In many languages abbreviations are very common and are widely used in both written and spoken language. However, they are not always explicitly defined and in many cases they are ambiguous. This research presents a process that attempts to solve the problem of abbreviation ambiguity using modern machine learning (ML) techniques. Various baseline features are explored, including context-related methods and statistical methods. The application domain is Jewish Law documents written in Hebrew and Aramaic, which are known to be rich in ambiguous abbreviations. Two research approaches were implemented and tested: general and individual. Our system applied four common ML methods to find a successful integration of the various baseline features. The best result was achieved by the SVM ML method in the individual research, with 98-07% accuracy.
AB - In many languages abbreviations are very common and are widely used in both written and spoken language. However, they are not always explicitly defined and in many cases they are ambiguous. This research presents a process that attempts to solve the problem of abbreviation ambiguity using modern machine learning (ML) techniques. Various baseline features are explored, including context-related methods and statistical methods. The application domain is Jewish Law documents written in Hebrew and Aramaic, which are known to be rich in ambiguous abbreviations. Two research approaches were implemented and tested: general and individual. Our system applied four common ML methods to find a successful integration of the various baseline features. The best result was achieved by the SVM ML method in the individual research, with 98-07% accuracy.
UR - http://www.scopus.com/inward/record.url?scp=77956373215&partnerID=8YFLogxK
U2 - 10.1002/asi.21367
DO - 10.1002/asi.21367
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:77956373215
SN - 1532-2882
VL - 61
SP - 1923
EP - 1932
JO - Journal of the American Society for Information Science and Technology
JF - Journal of the American Society for Information Science and Technology
IS - 9
ER -