socius-org / RedditHarborLinks
Ethical, legal, and effortless extraction of Reddit data in your database
☆69Updated 7 months ago
Alternatives and similar repositories for RedditHarbor
Users that are interested in RedditHarbor are comparing it to the libraries listed below
Sorting:
- Example scripts for the pushshift dump files☆368Updated last month
- ☆14Updated 9 months ago
- Newsfeed based on GDELT Project☆26Updated last year
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆37Updated 3 weeks ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆153Updated last year
- ☆23Updated 3 years ago
- A Python client for the GDELT 2.0 Doc API☆135Updated last month
- HDBSCAN Tuning for BERTopic Models☆47Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆415Updated 3 weeks ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆219Updated 2 years ago
- An open interface to GDELT APIs☆49Updated last year
- Download subreddit comments☆94Updated 3 years ago
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, fo…☆14Updated 3 years ago
- ☆54Updated 2 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆63Updated 3 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆91Updated 11 months ago
- Command-line utility to help researchers collect video metadata from Youtube API☆29Updated 9 months ago
- TikTok Content Scraper -- No API-Key needed, minimal dependencies, citable | Download videos (MP4), slides (JPEG) and metadata of author,…☆26Updated last month
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 11 months ago
- This repository contains all resources from the paper "Introducing MBIB - the first Media Bias Identification Benchmark Task and Dataset …☆31Updated last year
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Package to extract connotation frames☆85Updated last year
- Tools for conducting and parsing web search☆42Updated last week
- Extract ESCO skills and ISCO occupations from texts such as job descriptions or CVs☆13Updated last week
- Code for measuring novelty in science using publication text☆27Updated 3 months ago
- Comprehensive database of ratings for 11k news domains☆25Updated last year
- Linguistic Inquiry and Word Count (LIWC) analyzer☆213Updated 3 years ago
- The all-in-one Python package for seamless newspaper article indexing, scraping, and processing – supports public and premium content!☆22Updated 2 years ago