Watchful1 / PushshiftDumps
Example scripts for the pushshift dump files
☆275Updated this week
Related projects: ⓘ
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆234Updated last week
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆212Updated last year
- Python Pushshift.io API Wrapper (for comment/submission search)☆359Updated last year
- Reddit archiver☆149Updated 7 months ago
- Pushshift API☆1,289Updated last year
- ☆116Updated 11 months ago
- Download subreddit comments☆90Updated 2 years ago
- Read compressed NDJSON .zst files easily☆33Updated 2 years ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆46Updated 2 months ago
- Pushshift Telegram Ingest☆83Updated 5 years ago
- Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.☆780Updated 11 months ago
- ☆14Updated 2 weeks ago
- A simple module to collect video, text, and metadata from Tiktok.☆317Updated last month
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, fo…☆14Updated 2 years ago
- Cleans Reddit Text Data☆79Updated 4 years ago
- A Python scraper for Goodreads books and reviews.☆265Updated 4 months ago
- NLP tool to extract dimensions of social exchange from textual conversations☆10Updated 6 months ago
- The subreddit archiver☆172Updated last year
- Comprehensive database of ratings for 11k news domains☆21Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆113Updated 2 weeks ago
- A webmining CLI tool & library for python.☆277Updated last week
- Tools for collecting social media data around focal events☆83Updated 2 years ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆304Updated last month
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆137Updated 8 months ago
- Fast, flexible extraction of moral information from textual input data.☆102Updated last year
- Turn Tweet IDs into Twitter JSON & CSV from your desktop!☆428Updated last year
- ☆26Updated 3 years ago
- Old Twint style, but zero fat.☆267Updated last year
- Streaming WARC/ARC library for fast web archive IO☆369Updated 2 weeks ago
- An analysis of YouTube's political influence through recommendations.☆153Updated last year