Watchful1 / PushshiftDumps
Example scripts for the pushshift dump files
☆359Updated last month
Alternatives and similar repositories for PushshiftDumps:
Users that are interested in PushshiftDumps are comparing it to the libraries listed below
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆391Updated 3 weeks ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆218Updated 2 years ago
- Pushshift API☆1,337Updated 2 years ago
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated 2 years ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆68Updated 7 months ago
- Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.☆889Updated last year
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- ☆14Updated 8 months ago
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆137Updated 4 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆126Updated 4 months ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 3 years ago
- Download subreddit comments☆95Updated 3 years ago
- Comprehensive database of ratings for 11k news domains☆25Updated last year
- Article extraction benchmark: dataset and evaluation scripts☆315Updated last year
- ☆167Updated 2 years ago
- Fast, flexible extraction of moral information from textual input data.☆108Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 6 months ago
- analyze text with empath☆329Updated 8 years ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆347Updated last month
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆151Updated last year
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆378Updated 7 months ago
- Tools for collecting social media data around focal events☆84Updated 3 years ago
- Flexible calculation of moral foundation scores from textual input data based on word embedding methods.☆45Updated 2 years ago
- The repository contains a collection of tweets IDs associated with the 2020 U.S. Presidential Elections through 6 months post-inauguratio…☆133Updated 3 years ago
- ☆52Updated last year
- Package to extract connotation frames☆85Updated last year
- A Python scraper for Goodreads books and reviews.☆292Updated 2 months ago