SMO Wiki

A pretty snapshot of the Wiki brought to you by the Social Media Observatory at HBI

Twitter Tools

Twitter Tool List

This list provides an overview of useful data collection tools that can be used for research on Twitter. If you face problems or issues with one of the applications on the list, feel free to post an Issue. It helps us to maintain this list.

Overview

Most of these Twitter tools connect to official Twitter APIs and therefore need an API key from Twitter. You can retrieve an API key from Twitter easily, just follow the documentation. You are bound to the restrictions given by Twitter. You can read about the rate limits here. Version 2 of the API will be more restrictive (at least it looks like that at time of writing). As an academic, you can apply for access to the new academic track (Twitter Academic API Track Application) though to elevate your access levels to 10 million tweets per month and access to the ‘historic’ archive, i.e., tweets older than 7 days.

Some of the tools are scrapers, which do not use the official APIs. Please be aware that the use of these tools might violate Twitter’s Terms of Service. Despite being public, Twitter data can be very personal. Ensure to inform yourself thoroughly in order to follow data protection laws and ethical guidelines that apply to your research before starting your data collection.

Useful scrapers

Twitter API? Last Tested Language Interfaces Comments
Facepager V1/V2 2/2/2021 - GUI No programming needed
focalevents Academic Not Tested Python CLI depends on PostgreSQL
twacapic Academic 7/4/2021 Python CLI early development
twarc V1/V2/Academic 2/2/2021 Python CLI / Python Module Programming possible
TwitterAPI V1/V2/Academic 2/2/2021 Python Python Module Programming needed
Twint Scraper 2/2/2021 Python Python Module Programming needed
Twitterscraper V1 2/2/2021 Python Python Module Programming needed
tweepy V1/V2 2/2/2021 Python Python Module Programming needed
rtweet V1 Not Tested R R Module Programming needed
twitter-explorer V1/V2/Academic 19/01/2023 Python GUI No Programming needed
cta-tool V2/Academic 13/12/2021 Python Python Module programming needed, collecting and counting conversations, MongoDB
Twitter Downloader Academic 25/05/2022 - GUI no programming needed, access to Tweets only

Tools with extended analysis/refinement capabilities and/or dashboards

DMI-TCAT

The Digital Methods Initiative Twitter Capture and Analysis Toolset [DMI-TCAT] is a set of tools to retrieve and collect tweets from Twitter and to analyze them in various ways. It is written mostly in PHP and runs in a webserver (LAMP) environment…

DMI-TCAT provides robust and reproducible data capture and analysis, and interlinks with existing analytical software. Captured data sets can be refined in different ways (search queries, exclusions, date range, etc.) and the resulting selections of tweets can be analyzed in various ways, mainly by outputting files in standard formats (CSV for tabular files and GEXF for network files).

The big plus of DMI-TCAT is that it is organized around a MySQL database, which can run 24/7 robustly over months to years. However, setting up DMI-TCAT on a server requires some command line skills.

Installation Instructions

SMO-TMAS

SMO-TMAS allows users to pull tweets of specified Twitter handles and tweets containing specified keywords by querying Twitter’s REST API GET search/tweets endpoint and statuses/user_timeline endpoint as well as Twitter’s STREAM API. The collected tweets can be downloaded as .csv file and SMO-TMAS also provides data analysis components that can be used to analyze and visualize the collected data right away…

known issues and limitations:

  • is limited by the Twitter API limitations

Notable Features:

  • Ideal for small datasets
  • Accessible through the web
  • No local installation needed

Installation via: Accessible through a Web Application, no local installation needed.

Documentation and Development

Access

TAGS

TAGS is a free Google Sheet template which lets you setup and run automated collection of search results from Twitter…

known issues and limitations:

  • limited to search queries

Notable Features:

  • easy to use, without command line

Installation via: An Google account is needed to install this sheets

Download and installation instructions

Support forum for beginners and advanced users

Twitter Explorer

Twitter explorer is an openly accessible interface that can collect, transform and visualize Twitter data. It can create a complex network without prior programming skills. twitter explorer consists of mainly three parts which are collector, visualizer, and explorer…

Notable Features:

  • Collector :
  • Collects Twitter using API search.
  • Save the data in jsonl format.

  • Visualizer:
  • Timeline plotting
  • Can generate network depending on retweet and Hashtags networks
  • Can aggregate data based on node degree
  • Plot can be export in .gml/.csv/.gv

  • Explorer:
  • Display network using an html format with interactive dashboard in a browser.
  • Show information about datasets, number of nodes, node size, node scaling, users metadata on click.

Installation via: An installation package is available for Windows, Linux and MacOS

Requires python 3.6 or above .

# replace XXX by release number
 cd ~/Downloads/twitter-explorer-vXXX
pip install -r requirements.txt

After installation we can collect data using streamlit

streamlit run collector.py

Documentation and Usage

VOSONDash

VOSONDash is an interactive R Shiny web application for the visualisation and analysis of social network data. The app has a dashboard layout with sections for visualising and manipulating network graphs, performing text analysis, displaying network metrics and the collection of network data using the vosonSML R package…

known issues and limitations:

  • is limited by the Twitter API limitations

Notable Features:

  • R application that connect to different social media APIs
  • Built-in visual analysis, accessible through an web interface
  • Cross-platform analysis

Installation via: CRAN

Download

Installation and Usage

Twitter Downloader

Twitter Downloader is a tool from Twitter that allows researchers with Academic Access to configure search queries for tweets via a web tool and load them as CSV or JSON files…

known issues and limitations:

  • requires Academic Access
  • Access to Tweets only

Notable Features:

  • GUI: no programming skills needed
  • allows full access to the Twitter archive going back to 2006 (deleted tweets always excluded)

Helpers

Hydrator

Hydrator is an Electron based desktop application for hydrating Twitter ID datasets. Twitter’s Terms of Service do not allow the full JSON for datasets of tweets to be distributed to third parties. However they do allow datasets of tweet IDs to be shared. Hydrator helps you turn these tweet IDs back into JSON and also CSV from the comfort of your desktop…

Notable Features:

  • Program with a Graphic User Interface (GUI) making it easy to use for inexperienced users.

Installation via: An installation package is available for Windows, Linux and MacOS

Downloads

Documentation and Usage

Single Twitter ID Lookup webtools