jtbg / reddit-10-year-data
Data from the last ten years of reddit
☆45Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for reddit-10-year-data
- press history for /r/thebutton☆60Updated 9 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆162Updated 8 years ago
- aesthetically pleasing words☆121Updated 7 years ago
- Script that compares the average performance of a television show to its finale, identifying shows that surprise and disappoint☆38Updated 7 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆112Updated 9 years ago
- https://www.kaylinpavlik.com/text-mining-south-park/☆173Updated 8 years ago
- A collection of tools for mining government data☆139Updated 8 years ago
- ☆45Updated 8 years ago
- Simple Python scripts to download all Hacker News submissions and comments and store them in a PostgreSQL database.☆120Updated 7 years ago
- Scrape & analyze MTA arrival times☆152Updated 8 years ago
- An automated subreddit with posts created using markov chains☆468Updated 9 years ago
- Analysis of viber logs with R☆18Updated 9 years ago
- Principal Component Analysis and Fashion☆231Updated 9 years ago
- Download Hillary Clinton's emails and query them with sqlite☆153Updated 4 years ago
- Monitor /r/thebutton.☆131Updated 6 years ago
- Scraped data from the 2016 U.S. Election (President, Senate, House, Governor) and primaries, ballot measures and exit polls☆117Updated 5 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆286Updated last year
- ☆89Updated 9 years ago
- Political Speech Generator☆348Updated 8 years ago
- TensorFlow for AWS☆115Updated 9 years ago
- All stories and comments posted on Hacker News upto May 29, 2014☆128Updated 6 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆311Updated 8 years ago
- A fun image processing project in javascript that is marginally related to my learning theory research.☆58Updated 9 years ago
- OKCupid profile datasets, code to scrape okcupid, and code to compute reading level of text☆67Updated 8 years ago
- Twitter bot generating invented words and definitions using RNN + genetic algorithm☆131Updated 8 years ago
- Analysis of The Simpsons☆214Updated 4 years ago
- Loan-level analysis of Fannie Mae and Freddie Mac data☆216Updated 4 years ago
- A wrapper around tweepy to produce pandas dataframes for analysis☆75Updated 8 years ago