mattpodolak / pmawLinks
A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.
☆220Updated 2 years ago
Alternatives and similar repositories for pmaw
Users that are interested in pmaw are comparing it to the libraries listed below
Sorting:
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated 2 years ago
- Example scripts for the pushshift dump files☆387Updated last week
- Pushshift API☆1,363Updated 2 years ago
- Download subreddit comments☆96Updated 3 years ago
- A Python Package which helps to scrape all news details from any news websites☆214Updated 2 months ago
- Pushshift Telegram Ingest☆86Updated 5 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆74Updated 2 years ago
- Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.☆915Updated last year
- Cleans Reddit Text Data☆82Updated 5 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆154Updated last month
- This repository provides usage examples for the Python module Newspaper3k.☆147Updated last year
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆514Updated 2 weeks ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆356Updated 4 months ago
- GetOldTweets-Python is a project written in Python to mine old and backdated tweets, It bypasses some limitations/restrictions of the Twi…☆126Updated 2 years ago
- Read compressed NDJSON .zst files easily☆33Updated 3 years ago
- A python utility for downloading Common Crawl data☆242Updated 2 years ago
- Repository containing all files relevant to my basic and advanced tweet scraping articles.☆201Updated 2 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆52Updated 3 years ago
- Script for GoogleNews☆375Updated last year
- Fast and robust date extraction from web pages, with Python or on the command-line☆138Updated 3 weeks ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- A Python scraper for Goodreads books and reviews.☆295Updated 5 months ago
- A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆453Updated last year
- A Python Client for collect and parse public data from the Youtube Data API☆81Updated 2 years ago
- Repository for TweetEval☆382Updated 3 years ago
- 📊 Semantic search for headlines and story text☆360Updated last year
- Download YouTube video description and video comments without using the YouTube API.☆169Updated last year
- Measure the readability of a given text using surface characteristics☆79Updated 6 months ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆120Updated 5 years ago