mattpodolak / pmaw
A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.
☆218Updated 2 years ago
Alternatives and similar repositories for pmaw:
Users that are interested in pmaw are comparing it to the libraries listed below
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated 2 years ago
- Example scripts for the pushshift dump files☆361Updated last month
- Pushshift API☆1,337Updated 2 years ago
- Download subreddit comments☆95Updated 3 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- ☆124Updated last year
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆391Updated last month
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.☆891Updated last year
- Turn Tweet IDs into Twitter JSON & CSV from your desktop!☆435Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- ☆53Updated 2 years ago
- GetOldTweets-Python is a project written in Python to mine old and backdated tweets, It bypasses some limitations/restrictions of the Twi…☆126Updated 2 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆126Updated 4 months ago
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆57Updated 3 years ago
- A Python Package which helps to scrape all news details from any news websites☆201Updated this week
- Repository containing all files relevant to my basic and advanced tweet scraping articles.☆200Updated last year
- Interpretable data visualizations for understanding how texts differ at the word level☆275Updated 2 months ago
- This repository provides usage examples for the Python module Newspaper3k.☆147Updated last year
- Download YouTube video description and video comments without using the YouTube API.☆165Updated 11 months ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆185Updated 2 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆151Updated last year
- A web scraper for TikTok using Playwright☆87Updated 2 weeks ago
- A deep learning system for demographic inference (gender, age, and individual/person) that was trained on massive Twitter dataset using p…☆149Updated 2 years ago
- Releases for the reddit-graph project☆19Updated 9 months ago
- Facebook Post Scraper 🕵️🖱️☆332Updated 2 years ago
- Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19☆14Updated 4 years ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆68Updated 7 months ago