The resources here are either provided by the infolab or are highly recommended. We hope you find them useful.
download 1000 randomly selected twitter users and their tweets: This is an sample.tgz. Each user’s tweets reside within a file with the following attributes:
For example, a tweet sent by Joe would be in joe.tweets and be formatted as follows:
2008-11-14T16:01:38+00:00 i had catfood for dinner
NOTE: The contents of the tweets have not been censored in any way. All content is publically available, so no anonymization has been performed.
This is a collection of software that the infolab members have found particularly useful at one time or another. We are incredibly greatful to the authors of these assets.
Named entity taggers
Sentiment Polarity Lexicons
Features for supervised learning
Get Twitter Data by
Benchmarks or other more resources