TY - JOUR
T1 - A proposed methodology for studying the historical trajectory of words’ meaning through Tsallis entropy
AU - Neuman, Yair
AU - Cohen, Yochai
AU - Israeli, Navot
AU - Tamir, Boaz
N1 - Publisher Copyright:
© 2017 Elsevier B.V.
PY - 2018/2/15
Y1 - 2018/2/15
N2 - The availability of historical textual corpora has led to the study of words’ frequency along the historical time line, as representing the public's focus of attention over time. However, studying of the dynamics of words’ meaning is still in its infancy. In this paper, we propose a methodology for studying the historical trajectory of words’ meaning through Tsallis entropy. First, we present the idea that the meaning of a word may be studied through the entropy of its embedding. Using two historical case studies, we show that this entropy measure is correlated with the intensity in which a word is used. More importantly, we show that using Tsallis entropy with a superadditive entropy index may provide a better estimation of a word's frequency of use than using Shannon entropy. We explain this finding as resulting from an increasing redundancy between the words that comprise the semantic field of the target word and develop a new measure of redundancy between words. Using this measure, which relies on the Tsallis version of the Kullback–Leibler divergence, we show that the evolving meaning of a word involves the dynamics of increasing redundancy between components of its semantic field. The proposed methodology may enrich the toolkit of researchers who study the dynamics of word senses.
AB - The availability of historical textual corpora has led to the study of words’ frequency along the historical time line, as representing the public's focus of attention over time. However, studying of the dynamics of words’ meaning is still in its infancy. In this paper, we propose a methodology for studying the historical trajectory of words’ meaning through Tsallis entropy. First, we present the idea that the meaning of a word may be studied through the entropy of its embedding. Using two historical case studies, we show that this entropy measure is correlated with the intensity in which a word is used. More importantly, we show that using Tsallis entropy with a superadditive entropy index may provide a better estimation of a word's frequency of use than using Shannon entropy. We explain this finding as resulting from an increasing redundancy between the words that comprise the semantic field of the target word and develop a new measure of redundancy between words. Using this measure, which relies on the Tsallis version of the Kullback–Leibler divergence, we show that the evolving meaning of a word involves the dynamics of increasing redundancy between components of its semantic field. The proposed methodology may enrich the toolkit of researchers who study the dynamics of word senses.
KW - Historical corpora
KW - Meaning
KW - Natural language
KW - Tsallis entropy
KW - Words’ dynamics
UR - http://www.scopus.com/inward/record.url?scp=85035239038&partnerID=8YFLogxK
U2 - 10.1016/j.physa.2017.11.011
DO - 10.1016/j.physa.2017.11.011
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85035239038
SN - 0378-4371
VL - 492
SP - 804
EP - 813
JO - Physica A: Statistical Mechanics and its Applications
JF - Physica A: Statistical Mechanics and its Applications
ER -