Sharing Twitter datasets for research

(as of Feb 22, 2018)

In short, sharing Tweet IDs (unlimited amount) as a research institution for non-commercial research should be fine according to the Twitter Developer Agreement and Policy.


Storing

Storing data downloaded through the API is allowed. Geographic data, though, must be kept together with the content their are associated with.

Sharing

  • you can share up to 50,000 public tweets/users per day and per user of the service
  • you can share (for non-commercial research) an unlimited number of Tweet IDs if you are a research institution
  • developers must keep content in sync with the content on Twitter (if a user deletes a Tweet or protects their account, for example)

The license should be CC-BY-NC ( https://creativecommons.org/licenses/by-nc/4.0/).

Sidenote: there are explicit limitations for embargoed countries, not clear how these should be enforced on the sharing

[https://twittercommunity.com/t/policy-update-clarification-research-use-cases/87566]

[http://mith.umd.edu/miths-ed-summers-discusses-ferguson-twitter-archive/]

Chiara Boldrini
Chiara Boldrini
Senior researcher

My research interests include social computing, human-centered (causal) decentralized AI, and Internet of People