Blacksuan19 / scrapy-aiLinks
Fully automated AI based web scraping.
☆33Updated last week
Alternatives and similar repositories for scrapy-ai
Users that are interested in scrapy-ai are comparing it to the libraries listed below
Sorting:
- scraping and querying documents for LLMs☆24Updated 3 months ago
- Spider ported to Python☆101Updated last week
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆106Updated last year
- Extract structured data from any content using LLMs.☆109Updated last month
- Python SDK for Browserbase☆73Updated this week
- ☆21Updated last year
- Open Source LLMOps tool for AI teams☆129Updated 11 months ago
- Docx tracked change redlines for the Python ecosystem.☆99Updated last year
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Updated 5 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆158Updated last month
- A Prodigy plugin for PDF annotation☆36Updated 5 months ago
- Scrapfly Python SDK for headless browsers and proxy rotation☆50Updated 2 weeks ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆173Updated last week
- A GPT powered CLI tool that answers questions about your data☆98Updated 2 years ago
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆79Updated 4 years ago
- Spider templates for automatic crawlers.☆34Updated 2 weeks ago
- Docker Streamlit Template☆36Updated 4 months ago
- A Python client for the People Data Labs API☆35Updated last week
- A collection of Compound Retrieval Systems implemented with DSPy and Weaviate.☆92Updated 2 weeks ago
- Apify API client for Python☆88Updated last week
- An example to use MultiModal capabilities with Pydantic-AI to process and analyze images☆36Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- AI + Legal APIs: A Tool-Based Retrieval Augmented Generation Workbench for Legal AI UX Research.☆121Updated last year
- ☆80Updated 2 weeks ago
- Example LangGraph flow that does "competitor analysis" on the web.☆38Updated last year
- ☆20Updated last week
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated 2 years ago
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆48Updated last year
- Open-source versioning, tracing, and annotation tooling.☆212Updated 2 months ago
- simplifies the process of creating and managing LLM workflows.☆113Updated last year