academictwitteR

Repo containing code to loop through usernames/hashtags and collect tweets from Full Archive v2 API endpoint for the Academic Research Product Track. Uses new pagination_token query params. Repo now contains skeleton for developing package to contain all functions; for now, main functions are located in folder "R/".

Installation

You can install the development package with:

devtools::install_github("cjbarrie/academictwitteR")

NOTE: the name of the package has been changed from twittterv2r.

Demo

Getting tweets of specified users via get_user_tweets(). This function captures tweets for a particular user or set of users and collects tweets between specified date ranges, avoiding rate limits by sleeping between calls. A call may look like:


bearer_token <- "" # Insert bearer token

users <- c("TwitterDev", "jack")
get_user_tweets(users, "2020-01-01T00:00:00Z", "2020-01-05T00:00:00Z", bearer_token, data_path = "data/")

Getting tweets of specified list of hashtags via get_hashtag_tweets(). This function captures tweets for a particular hashtag or set of hashtags between specified date ranges, avoiding rate limits by sleeping between calls. See here for information on building queries for search tweets.

A call may look like:


bearer_token <- "" # Insert bearer token

get_hashtag_tweets("#BLM OR #BlackLivesMatter", "2020-01-01T00:00:00Z", "2020-01-05T00:00:00Z", bearer_token, data_path = "data/")

Function originally taken from Gist by https://github.com/schochastics.

Files are stores as JSON files in folders "data/" and "includes/," where "data/" contains the main tweet parameters, and "includes/" contains additional user-level information.

If a filename is supplied, the functions will save the result as a RDS file, otherwise, they will return the results as a dataframe.

For more information on the parameters and fields included in queries to new v2 Endpoint see: https://developer.twitter.com/en/docs/twitter-api/tweets/search/api-reference/get-tweets-search-all.

Note on User Information

The API call returns both the tweet data and the user information separately, but currently only the former is parsed. Is it possible to obtain other user information such as user handle and display name. These can then be merged with the dataset using the author_id field.

bearer_token <- "" # Insert bearer token

users <- c("TwitterDev", "jack")
tweets_df <- get_user_tweets(users, "2020-01-01T00:00:00Z", "2020-01-05T00:00:00Z", bearer_token)

get_user_profile(unique(tweets_df$author_id), bearer_token)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
R		R
man		man
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
twittterv2r.Rproj		twittterv2r.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

academictwitteR

Installation

Demo

Note on User Information

About

Releases

Packages

Languages

luisignaciomenendez/academictwitteR

Folders and files

Latest commit

History

Repository files navigation

academictwitteR

Installation

Demo

Note on User Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages