socius-org / RedditHarborLinks
Ethical, legal, and effortless extraction of Reddit data in your database
☆75Updated 10 months ago
Alternatives and similar repositories for RedditHarbor
Users that are interested in RedditHarbor are comparing it to the libraries listed below
Sorting:
- Example scripts for the pushshift dump files☆386Updated 2 weeks ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆154Updated 3 weeks ago
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆498Updated last month
- ☆14Updated 3 weeks ago
- A Python client for the GDELT 2.0 Doc API☆146Updated 3 months ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆220Updated 2 years ago
- ☆109Updated last year
- an extensible tool to generate hyperlinks from legal citations☆34Updated 10 months ago
- A Python Package which helps to scrape all news details from any news websites☆213Updated 2 months ago
- Newsfeed based on GDELT Project☆29Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆35Updated 11 months ago
- API client for Truth Social☆244Updated 10 months ago
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆42Updated 6 years ago
- HDBSCAN Tuning for BERTopic Models☆48Updated 2 years ago
- An affect generator based on TextBlob and the NRC affect lexicon. Note that lexicon license is for research purposes only.☆73Updated 2 years ago
- Download subreddit comments☆96Updated 3 years ago
- This repository provides usage examples for the Python module Newspaper3k.☆147Updated last year
- A News Article Collection Library☆22Updated 2 years ago
- A python command line tool to help you search your chatgpt conversation history.☆27Updated last year
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆142Updated 7 months ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆355Updated 4 months ago
- Influencer dataset collected from Instagram☆113Updated 2 years ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆183Updated 2 months ago
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Powerful topic model visualization in Python☆130Updated 4 months ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆18Updated 8 months ago
- A simple script for using Google's Vision API that will possibly develop into an actual tool.☆13Updated 7 years ago
- ☆126Updated 2 months ago
- A Tool for Navigating LLMs and Prompts for Computational Social Science and Digital Humanities Research☆25Updated last year