Abstract
Analysis of a corpus of tens of thousands of blogs -incorporating close to 300 million words - indicates significant differences in writing style and content between male and female bloggers as well as among authors of different ages. Such differences can be exploited to determine an unknown author's age and gender on the basis of a blog's vocabulary.
Original language | English |
---|---|
Title of host publication | Computational Approaches to Analyzing Weblogs - Papers from the AAAI Spring Symposium, Technical Report |
Pages | 191-197 |
Number of pages | 7 |
State | Published - 2006 |
Event | 2006 AAAI Spring Symposium - Stanford, CA, United States Duration: 27 Mar 2006 → 29 Mar 2006 |
Publication series
Name | AAAI Spring Symposium - Technical Report |
---|---|
Volume | SS-06-03 |
Conference
Conference | 2006 AAAI Spring Symposium |
---|---|
Country/Territory | United States |
City | Stanford, CA |
Period | 27/03/06 → 29/03/06 |