Abstract
Analysis of a corpus of tens of thousands of blogs –
incorporating close to 300 million words – indicates
significant differences in writing style and content between
male and female bloggers as well as among authors of
different ages. Such differences can be exploited to
determine an unknown author's age and gender on the basis
of a blog's vocabulary
Original language | American English |
---|---|
Title of host publication | AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs |
State | Published - 2006 |