Assessing the Bias in Communication Networks Sampled from Twitter

Sandra González-Bailón,Ning Wang,A. Rivero,Javier Borge-Holthoefer,Y. Moreno

Published 2012 in arXiv.org

ABSTRACT

We collect and analyse messages exchanged in Twitter using two of the platform's publicly available APIs (the search and stream specifications). We assess the differences between the two samples, and compare the networks of communication reconstructed from them. The empirical context is given by political protests taking place in May 2012: we track online communication around these protests for the period of one month, and reconstruct the network of mentions and re-tweets according to the two samples. We find that the search API over-represents the more central users and does not offer an accurate picture of peripheral activity; we also find that the bias is greater for the network of mentions. We discuss the implications of this bias for the study of diffusion dynamics and collective action in the digital era, and advocate the need for more uniform sampling procedures in the study of online communication.

PUBLICATION RECORD

Publication year
2012
Venue
arXiv.org
Publication date
2012-12-07
Fields of study
Physics, Computer Science, Political Science
Identifiers
DOI 10.2139/ssrn.2185134 arXiv 1212.1684
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

What is Twitter
2013cited by this paper
Digitally Enabled Social Change: Activism in the Internet Age
2012cited by this paper
Twitter, MySpace, Digg: Unsupervised Sentiment Analysis in Social Media
2012cited by this paper
Broadcasters and Hidden Influentials in Online Protest Diffusion
2012cited by this paper
The Consequences of the Internet for Politics
2012cited by this paper
The World of Connections and Information Flow in Twitter
2012cited by this paper
The Social World of Twitter: Topics, Geography, and Emotions
2012cited by this paper
Geography of Twitter networks
2012cited by this paper
Who says what to whom on twitter
2011cited by this paper
Information credibility on twitter
2011cited by this paper
Social Features of Online Networks: The Strength of Intermediary Ties in Online Social Media
2011influential reference
Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter
2011cited by this paper
Structural and Dynamical Patterns on Online Social Networks: The Spanish May 15th Movement as a Case Study
2011cited by this paper
Dynamical classes of collective attention in twitter
2011cited by this paper
The Dynamics of Protest Recruitment through an Online Network
2011cited by this paper
Modular networks of word correlations on Twitter
2011cited by this paper
Political Polarization on Twitter
2011influential reference
Everyone's an influencer: quantifying influence on twitter
2011cited by this paper
Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter
2011cited by this paper
Modeling Users' Activity on Twitter Networks: Validation of Dunbar's Number
2011cited by this paper
The Diversity-Bandwidth Trade-off1
2011cited by this paper
#iranElection: quantifying online activism
2010cited by this paper
Measuring User Influence in Twitter: The Million Follower Fallacy
2010influential reference
Twitter mood predicts the stock market
2010cited by this paper
Dynamic Debates: An Analysis of Group Polarization Over Time on Twitter
2010cited by this paper
What is Twitter, a social network or a news media?
2010influential reference
Change and External Events in Computer-Mediated Citation Networks: English Language Weblogs and the 2004 U.S. Electoral Cycle*
2009cited by this paper
Twitter power: Tweets as electronic word of mouth
2009cited by this paper
Beyond Microblogging: Conversation and Collaboration via Twitter
2009cited by this paper
Social Networks that matter: Twitter under the Microscope
2008cited by this paper
Why we twitter: understanding microblogging usage and communities
2007cited by this paper
Reconceptualizing Collective Action in the Contemporary Media Environment
2005cited by this paper
--- ! ! DRAFT VERSION ! !---- ! !
2004influential reference
Which Public Goods are Endangered?: How Evolving Communication Technologies Affect The Logic of Collective Action
2003cited by this paper
Social Movements and Networks: Relational Approaches to Collective Action
2003cited by this paper
Networks and Social Movements: A Research Programme
2003cited by this paper
Social Movements and Networks
2003cited by this paper
Models of core/periphery structures
2000cited by this paper
Specifying the Relationship Between Social Ties and Activism
1993cited by this paper
The critical mass in collective action
1993cited by this paper
Power and Centrality: A Family of Measures
1987cited by this paper
Recruitment to High-Risk Activism: The Case of Freedom Summer
1986cited by this paper
Network structure and minimum degree
1983cited by this paper
The Logic of Collective Action
1965cited by this paper

CITED BY

Mapping the evolving networks of the #StopAsianHate movement on Twitter: the role of serial participants in digital activism
2025cites this paper
Spatial Vitality Detection and Evaluation in Zhengzhou’s Main Urban Area
2024cites this paper
Building a nationally representative sample of teachers’ online and offline: the Public Instructional Network of School Resources
2023cites this paper
Sustainable News and Their Impact on Company’s Market Capitalization Does Sustainable News About a Company Have a Positive Relationship with Its Stock Price?
2022cites this paper
The Momo Challenge: measuring the extent to which YouTube portrays harmful and helpful depictions of a suicide game
2021influential citation
Crawling Twitter data through API: A technical/legal perspective
2021cites this paper
Data-driven educational algorithms pedagogical framing
2020cites this paper
Twitter Users’ Views on Mental Health Crisis Resolution Team Care Compared With Stakeholder Interviews and Focus Groups: Qualitative Analysis
2020cites this paper
Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries
2019cites this paper
Twitter as Data
2018cites this paper
Twitter Influencers in the 2016 US Congressional Races
2018cites this paper
Nonprobability Sampling and Twitter
2018cites this paper
Social Networks as Real-time Data Distribution Platforms for Smart Cities
2018cites this paper
Studying Networked Communication in the Middle East : Social Disrupter and Social Observatory
2018cites this paper
Political rumoring on Twitter during the 2012 US presidential election: Rumor diffusion and correction
2017cites this paper
Community Detection in Political Discussions on Twitter
2017cites this paper
Cognitive big data: survey and review on big data research and its implications. What is really "new" in big data?
2017cites this paper
Public crowdsensing of heat waves by social media data
2017cites this paper
Probing the Limits of Social Data - Biases, Methods, and Domain Knowledge
2016influential citation
Improving the Veracity of Open and Real-Time Urban Data
2016cites this paper
Dialect in digitally mediated written interaction: a survey of the geohistorical distribution of the ditransitive in British English using Twitter
2016cites this paper
A Comprehensive Survey on Big-Data Research and its Implications - What is Really 'New' in Big Data? - IT's Cognitive Big Data!
2016cites this paper
Beyond data collection: Objectives and methods of research using VGI and geo-social media for disaster management
2016cites this paper
@Spain Is Different: Co-branding Strategies Between Spanish National and Regional DMOs on Twitter
2015cites this paper
What to Expect When the Unexpected Happens: Social Media Communications Across Crises
2015cites this paper
Understanding the Political Representativeness of Twitter Users
2015cites this paper
Characterizing interactions in online social networks during exceptional events
2015cites this paper
Reliability of Data Collection Methods in Social Media Research
2015cites this paper
In the name of Development: Power, profit and the datafication of the global South
2015cites this paper
Is bigger better? The emergence of big data as a tool for international development policy
2015cites this paper
Learning Human Dynamics with Big Data from Online Social Networks
2015cites this paper
Using APIs for Data Collection on Social Media
2014cites this paper
Two Essays on Information Asymmetry
2014cites this paper
Methodological challenges of studying social media from the perspective of information manipulation
2014cites this paper
Stuttgart’s Black Thursday on Twitter : Mapping Political Protests with Social Media Data
2014cites this paper
Content and Network Dynamics Behind Egyptian Political Polarization on Twitter
2014cites this paper
Big Data, Big Questions| Working Within a Black Box: Transparency in the Collection and Production of Big Twitter Data
2014cites this paper
Two 1%s Don't Make a Whole: Comparing Simultaneous Samples from Twitter's Streaming API
2014cites this paper
Using APIs for Data Collection on Social Media
2014cites this paper
Using Social Media Content to Inform Agent-based Models for Humanitarian Crisis Response
2014cites this paper
The Arab Spring on Twitter: Language Communities in #egypt and #libya
2014cites this paper
Exploiting Twitter for Border Security-Related Intelligence Gathering
2013cites this paper
Contraction of Online Response to Major Events
2013cites this paper
The danger of a big data episteme and the need to evolve geographic information systems
2013cites this paper
Amostragem e Caraterização de Coleções de Dados do Twitter
2013cites this paper
Construção de Amostras de Dados do Twitter
2013cites this paper
Studying Physical Activity Using Social Media: An Analysis of the Added Value of RunKeeper Tweets
2013cites this paper
Cascading behaviour in complex socio-technical networks
2013cites this paper
The Bridges and Brokers of Global Campaigns in the context of Social Media
2013cites this paper
Social Science in the Era of Big Data
2013cites this paper
Using Twitter to Explore the Effect of Social Media Buzz on Sales Bachelor Thesis
2013cites this paper
Predicting American Idol with Twitter Sentiment
2013cites this paper
The Danger of a Big Data Episteme and the Need to Evolve GIS
2013cites this paper
The utility of social and topical factors in anticipating repliers in Twitter conversations
2013cites this paper