Every second, the thoughts and feelings of millions of people across the world are recorded in the form of 140-character tweets using Twitter. However, despite the enormous potential presented by this remarkable data source, we still do not have an understanding of the Twitter population itself: Who are the Twitter users? How representative of the overall population are they? In this paper, we take the first steps towards answering these questions by analyzing data on a set of Twitter users representing over 1% of the U.S. population. We develop techniques that allow us to compare the Twitter population to the U.S. population along three axes (geography, gender, and race/ethnicity), and find that the Twitter population is a highly non-uniform sample of the population.
|Title of host publication||Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media|
|Publication status||Published - 2011|
|Event||5th International AAAI Conference on Weblogs and Social Media (ICWSM 2011) - Barcelona, Spain|
Duration: 17 Jul 2011 → 21 Jul 2011
|Conference||5th International AAAI Conference on Weblogs and Social Media (ICWSM 2011)|
|Period||17/07/2011 → 21/07/2011|