Ad verba per numeros

Wednesday, June 4, 2014, 01:46 PM
A few days ago Frederic Guerrero-Solé (@guerrerosolef) pointed me to a paper he had recently published. The abstract is very explicit so I'll provide it verbatim:
In recent years, social networking sites (SNS) have become enormously popular, in particular microblogging sites such as Twitter. Twitter is nowadays one of the most used SNS for social, economic and political communication. Drawing on well-known characteristics of social networks and human behavior, i.e. the homophilic behavior of individuals, the power law distribution of influence and retweeting, and the nature of endorsement and the reduction of noise of retweeting, this article proposes a two-step method to first uncover the structure of the network of the top influential Twitter users in a political discussion and second based on the resulting structure of political clusters, predict the voters’ perception of the closeness between parties, the proportion of undecided voters between two given parties and the support for pacts between parties. The method analyzes the overlaps of communities of retweeters of the most influent users in a political conversation, and applies it to the Catalan elections in 2012. Comparing our results with those of the Spanish Center for Sociological Research, we show that the overlaps between parties' communities of retweeters are a good predictor of voters' indecision and preferences for post-electoral parliamentary support and coalitions.

In short, Frederic et al. have analyzed political discussion by exploiting retweets in a quite novel way; instead of studying the RTing cascades they have paid attention to the communities that co-RTin induces. Furthermore, they have analyzed a multiparty scenario which, IMHO, is much more interesting than common analyses of Democrats vs Republicans that pervade the literature.

Needless to say, the paper has limitations as any other political research conducted on Twitter data but I hope to see this approach replicated in other multiparty scenarios in the near future.

As a bonus track I provide a number of additional references that can be of interest for anyone working in the area, and/or trying to extend the work by Guerrero-Solé et al.

  • First, a couple of papers providing support to the idea that tweets convey (in a latent and subtle way) voting intention: DiGrazia et al., 2013; Jensen & Anstead, 2013).
  • Then, a couple of papers discussing the worrying bias caused by relying on samples of tweets: González-Bailón et al., 2012; Morstatter et al., 2013.
  • A nice discussion about the meaning of edited retweets regarding political discussion: Mustafaraj & Metaxas, 2011.
  • And, finally, some recent literature about community detection: Fortunato, 2010; Xie et al., 2013; Yang & Leskovec, 2013.

Back Next