mattpodolak / pmaw
A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.
☆215Updated last year
Alternatives and similar repositories for pmaw:
Users that are interested in pmaw are comparing it to the libraries listed below
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated last year
- Example scripts for the pushshift dump files☆308Updated last week
- Pushshift API☆1,311Updated last year
- Download subreddit comments☆93Updated 2 years ago
- ☆124Updated last year
- Pushshift Telegram Ingest☆84Updated 5 years ago
- Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.☆840Updated last year
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆305Updated last week
- Cleans Reddit Text Data☆81Updated 4 years ago
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆68Updated 2 years ago
- GetOldTweets-Python is a project written in Python to mine old and backdated tweets, It bypasses some limitations/restrictions of the Twi…☆128Updated last year
- Repository containing all files relevant to my basic and advanced tweet scraping articles.☆199Updated last year
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated last year
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆142Updated last year
- ☆53Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆100Updated 2 months ago
- Repository for TweetEval☆363Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆308Updated 7 months ago
- A Python Client for collect and parse public data from the Youtube Data API☆80Updated last year
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆165Updated 4 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆52Updated 3 years ago
- A Python API for Botometer by OSoMe☆384Updated 6 months ago
- A multilingual lexicon of words to hurt.☆82Updated 2 months ago
- ☆14Updated 4 months ago
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆117Updated 5 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆274Updated 6 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆322Updated 4 months ago
- Concept Modeling: Topic Modeling on Images and Text☆201Updated 2 months ago