umbrae / reddit-top-2.5-millionLinks
This is a dataset of the all-time top 1,000 posts, from the top 2,500 subreddits by subscribers, pulled from reddit between August 15–20, 2013.
☆623Updated 5 years ago
Alternatives and similar repositories for reddit-top-2.5-million
Users that are interested in reddit-top-2.5-million are comparing it to the libraries listed below
Sorting:
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆291Updated 2 years ago
- A collection of reddit bots and utilities☆495Updated last year
- Automatic scraper that tracks changes in news articles over time.☆501Updated 4 years ago
- will try to make interesting reddit crawlers that give some insight☆380Updated 8 years ago
- Extract user info from their reddit comments and activity.☆68Updated last year
- Downloads images from sub-reddits of reddit.com.☆310Updated 2 years ago
- SnoopSnoo — reddit user and subreddits analytics☆89Updated 8 years ago
- Data from the last ten years of reddit☆45Updated 10 years ago
- Looks up posts from reddit and automatically posts them on Twitter.☆143Updated 5 years ago
- A full list of open source bots.☆139Updated 7 years ago
- Will poll for Retweet Contests and retweet them. Inspired by http://www.hscott.net/twitter-contest-winning-as-a-service/☆235Updated 6 years ago
- Here's what you sound like...☆133Updated 2 years ago
- Searches for pronbots by looking at followers/following☆92Updated 7 years ago
- Download data from IMDB movies and parse into useful form☆205Updated 5 years ago
- Simple Python scripts to download all Hacker News submissions and comments and store them in a PostgreSQL database.☆122Updated 8 years ago
- Scrapes public information off of LinkedIn☆111Updated 9 years ago
- Tools to work with the big reddit JSON data dump.☆256Updated last year
- Download Hillary Clinton's emails and query them with sqlite☆153Updated 5 years ago
- .csv files containing script information including: season, episode, character, & line.☆163Updated last year
- An automated subreddit with posts created using markov chains☆468Updated 9 years ago
- ☆191Updated 8 years ago
- Find your Facebook friends' Tinder profiles. Don't actually use this by the way that's weird. Not even in a good way.☆702Updated 7 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 9 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 9 years ago
- A list of scrapers from around the web.☆684Updated 6 months ago
- saves all .json data from posts and comments in a subreddit☆78Updated 8 years ago
- A Markov chain based text generation library and MegaHAL style chatbot☆243Updated 4 years ago
- Reddit bot that replies to comments with excerpt from linked wikipedia article or section.☆96Updated 10 years ago
- Python scripts to download Quora answers and convert them into a more portable form☆126Updated 4 years ago
- Grabbing all news.☆62Updated 5 years ago