umbrae / reddit-top-2.5-million
This is a dataset of the all-time top 1,000 posts, from the top 2,500 subreddits by subscribers, pulled from reddit between August 15–20, 2013.
☆616Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for reddit-top-2.5-million
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆286Updated last year
- A collection of reddit bots and utilities☆482Updated 4 months ago
- Extract user info from their reddit comments and activity.☆57Updated 9 months ago
- will try to make interesting reddit crawlers that give some insight☆381Updated 7 years ago
- ☆253Updated 2 years ago
- SnoopSnoo — reddit user and subreddits analytics☆87Updated 7 years ago
- A Markov chain based text generation library and MegaHAL style chatbot☆242Updated 3 years ago
- Looks up posts from reddit and automatically posts them on Twitter.☆138Updated 4 years ago
- ☆365Updated 6 years ago
- Summarizes news articles☆1,169Updated 3 years ago
- An automated subreddit with posts created using markov chains☆468Updated 9 years ago
- Automatic Web Article Summarizer☆414Updated 3 years ago
- Here's what you sound like...☆132Updated last year
- saves all .json data from posts and comments in a subreddit☆78Updated 7 years ago
- A python script for summarizing articles using nltk☆542Updated 8 years ago
- Simple example scripts for Twitter data collection with Tweepy in Python☆169Updated 4 years ago
- Uses frequency analysis to summarize text.☆185Updated last year
- Data from the last ten years of reddit☆45Updated 9 years ago
- Collection of my Reddit Bots, some with tutorials describing them☆32Updated 11 years ago
- A full list of open source bots.☆140Updated 6 years ago
- An application for parsing chat history from a Facebook data archive.☆312Updated 6 years ago
- Will poll for Retweet Contests and retweet them. Inspired by http://www.hscott.net/twitter-contest-winning-as-a-service/☆236Updated 5 years ago
- Pulls a listing of Reddit posts and then pulls the metadata (id, title, date/time, permalink, domain, url, author, score, upvote ratio, n…☆12Updated 6 years ago
- A simple AI capable of basic reading comprehension☆370Updated 8 years ago
- Markov chain text generator, as used for KingJamesProgramming☆459Updated 2 months ago
- Unofficial Python API for Hacker News. RESTful API at https://github.com/karan/HNify☆391Updated 5 years ago
- An auto robot to like my GF's post on Instagram☆401Updated 5 years ago
- A curated list of beginner resources in Natural Language Processing☆384Updated 7 years ago
- Records the activity (comments and karma) on the hot page of a Reddit sub and prepare an animated data visualisation.☆93Updated 7 years ago