socius-org / RedditHarborLinks
Ethical, legal, and effortless extraction of Reddit data in your database
☆75Updated 9 months ago
Alternatives and similar repositories for RedditHarbor
Users that are interested in RedditHarbor are comparing it to the libraries listed below
Sorting:
- Example scripts for the pushshift dump files☆376Updated last week
- QualiGPT: An easy-to-use tool for qualitative research☆32Updated 9 months ago
- Newsfeed based on GDELT Project☆29Updated last year
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆474Updated last week
- A web scraper for TikTok using Playwright☆96Updated 2 months ago
- ☆108Updated last year
- An open interface to GDELT APIs☆51Updated last year
- A Python client for the GDELT 2.0 Doc API☆144Updated 2 months ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆153Updated last year
- ☆14Updated this week
- The Python toolkit for converting Reddit threads into organized text data. Extract and process Reddit content with ease!☆103Updated 11 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆39Updated last month
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆221Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆35Updated 10 months ago
- A News Article Collection Library☆22Updated 2 years ago
- A Tool for Navigating LLMs and Prompts for Computational Social Science and Digital Humanities Research☆25Updated 11 months ago
- A Python Package which helps to scrape all news details from any news websites☆211Updated last month
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆142Updated 6 months ago
- HDBSCAN Tuning for BERTopic Models☆48Updated 2 years ago
- TikTok Content Scraper -- No API-Key needed, minimal dependencies, citable | Download videos (MP4), slides (JPEG) and metadata of author,…☆27Updated last week
- Open Access PDF harvester, metadata aggregator and full-text ingester☆61Updated last year
- ☆24Updated 3 years ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆46Updated 2 months ago
- A webmining CLI tool & library for python.☆331Updated last month
- A small command line tool and set of functions for studying coordination networks in Twitter and other social media data.☆79Updated 2 years ago
- ☆18Updated 2 years ago
- A prompt-engineering technique for creating personalized custom instructions on ChatGPT☆17Updated last year
- EmailGenius: AI-Driven Email Categorization☆26Updated last year
- Building a Job Dataset☆22Updated 3 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago