maxjo020418 / BAScraper
A simple asynchronous Python Reddit API wrapper for fetching posts, comments for data anlytics from Reddit. Utilizes PullPush and Arctic-Shift.
☆13Updated this week
Alternatives and similar repositories for BAScraper:
Users that are interested in BAScraper are comparing it to the libraries listed below
- Repository for deepdoctection tutorial notebooks☆42Updated 2 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆20Updated 3 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆66Updated 6 months ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆21Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆47Updated 6 months ago
- ☆15Updated 11 months ago
- Tools for interactive visual exploration of semantic embeddings.☆30Updated 5 months ago
- Example LangGraph flow that does "competitor analysis" on the web.☆23Updated 8 months ago
- Create a music review RAG application with Neo4j☆19Updated 11 months ago
- We are exploring the potential impact of Generative AI on Nesta's Missions and work to uncover opportunities and risks that can inform Ne…☆27Updated 7 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆16Updated last year
- Crawl and convert any website into clean markdown☆44Updated 8 months ago
- Ethical, legal, and effortless extraction of Reddit data in your database☆64Updated 4 months ago
- Visual Studio Code extension to convert HTML to FastHTML FT☆17Updated this week
- A spaCy wrapper for GliNER☆107Updated 3 weeks ago
- The Python toolkit for converting Reddit threads into organized text data. Extract and process Reddit content with ease!☆89Updated 6 months ago
- A flexible and easy to use tool for Semantic Routing☆17Updated 5 months ago
- A Google Trends Analytics Package☆13Updated 8 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- Pull reddit data from APIs and store it in local db☆13Updated 2 months ago
- Daily TV News Summary using GPT☆23Updated 2 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆134Updated last month
- Python crawler for getting books' metadata from the Google Books API using asyncio and aiohttp☆25Updated 4 years ago
- In this repository we put the code to split a document in a consistent way based on the concept of "idea"☆12Updated 3 months ago
- Scrape various open data directories to create an index of what's available out there☆36Updated last week
- This repository demonstrates a simple OpenAI Swarm-based system for multi-agent orchestration with Retrieval-Augmented Generation (RAG). …☆10Updated 4 months ago
- Code for DELSumm, an unsupervised summarization algorithm for legal case judgements.☆28Updated 2 years ago
- 🦦 weasel: A small and easy workflow system☆75Updated 7 months ago
- This is the python program which performs text summarization with pronoun replacement method. This method initially identifies pronouns i…☆11Updated 6 years ago
- Web crawler for Burplist, a search engine for craft beers in Singapore☆14Updated this week