Blacksuan19 / scrapy-aiLinks
Fully automated AI based web scraping.
☆32Updated 9 months ago
Alternatives and similar repositories for scrapy-ai
Users that are interested in scrapy-ai are comparing it to the libraries listed below
Sorting:
- Spider ported to Python☆97Updated 9 months ago
- scraping and querying documents for LLMs☆24Updated last month
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 5 years ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆30Updated 2 years ago
- ☆62Updated 7 months ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆148Updated last week
- ☆20Updated 7 months ago
- A FastAPI extension for integrating common AI agent frameworks.☆45Updated 9 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆150Updated 3 weeks ago
- ☆22Updated last year
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆79Updated 4 years ago
- https://verdad.app☆83Updated last week
- Python SDK for Browserbase☆69Updated last week
- Spider templates for automatic crawlers.☆32Updated last month
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆48Updated last year
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆104Updated last year
- simplifies the process of creating and managing LLM workflows.☆112Updated last year
- Awesome and Best FastHTML Resources For Python Developers☆75Updated last year
- Docx tracked change redlines for the Python ecosystem.☆87Updated last year
- Library that helps use puppeteer in scrapy.☆52Updated 3 months ago
- DuckDB Community Extension to prompt LLMs from SQL☆51Updated last month
- Example LangGraph flow that does "competitor analysis" on the web.☆37Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆45Updated 3 months ago
- Import unstructured data (text and images) into structured tables☆159Updated last week
- Extract structured data from any content using LLMs.☆55Updated last week
- S3 vector database for LLM Agents and RAG.☆49Updated 2 years ago
- ☆104Updated 5 months ago
- A curated list of tools related to notebooklm as well as examples of great podcasts generated by notebooklm☆88Updated last year
- Visual Studio Code extension to convert HTML to FastHTML FT☆21Updated 8 months ago