socius-org / RedditHarbor
Ethical, legal, and effortless extraction of Reddit data in your database
☆64Updated 5 months ago
Alternatives and similar repositories for RedditHarbor:
Users that are interested in RedditHarbor are comparing it to the libraries listed below
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆349Updated this week
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆147Updated last year
- QualiGPT: An easy-to-use tool for qualitative research☆24Updated 5 months ago
- Example scripts for the pushshift dump files☆334Updated this week
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆59Updated last month
- Releases for the reddit-graph project☆19Updated 7 months ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆216Updated last year
- HDBSCAN Tuning for BERTopic Models☆44Updated last year
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆42Updated 5 years ago
- Code for measuring novelty in science using publication text☆24Updated last week
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆163Updated 8 months ago
- Cleans Reddit Text Data☆81Updated 4 years ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆15Updated 3 months ago
- A Python library for calculating a large variety of metrics from text☆329Updated 2 months ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 2 years ago
- Download subreddit comments☆93Updated 3 years ago
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆26Updated 8 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆37Updated last year
- Command-line utility to help researchers collect video metadata from Youtube API☆29Updated 6 months ago
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 6 months ago
- Tools for conducting and parsing web search☆39Updated this week
- Powerful topic model visualization in Python☆113Updated 3 weeks ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆82Updated 8 months ago
- A python command line tool to help you search your chatgpt conversation history.☆24Updated last year
- Interactive visual tool for the demonstration of topic evolution☆40Updated 4 years ago
- Scrape articles and comments from NYTimes☆19Updated last year
- ☆22Updated 4 years ago
- Pushshift Telegram Ingest☆85Updated 5 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated 10 months ago
- Package to extract connotation frames☆83Updated last year