Blacksuan19 / scrapy-aiLinks
Fully automated AI based web scraping.
☆33Updated 10 months ago
Alternatives and similar repositories for scrapy-ai
Users that are interested in scrapy-ai are comparing it to the libraries listed below
Sorting:
- scraping and querying documents for LLMs☆24Updated 2 months ago
- Spider ported to Python☆100Updated 11 months ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆106Updated last year
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆30Updated 2 years ago
- Docx tracked change redlines for the Python ecosystem.☆95Updated last year
- Spider templates for automatic crawlers.☆33Updated 3 weeks ago
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 5 years ago
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆48Updated last year
- Library that helps use puppeteer in scrapy.☆52Updated 4 months ago
- ☆20Updated 9 months ago
- Python SDK for Browserbase☆70Updated last week
- A FastAPI extension for integrating common AI agent frameworks.☆46Updated 11 months ago
- Visual Studio Code extension to convert HTML to FastHTML FT☆22Updated 10 months ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆167Updated this week
- simplifies the process of creating and managing LLM workflows.☆114Updated last year
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- Open Source LLMOps tool for AI teams☆129Updated 10 months ago
- agenty☆43Updated 10 months ago
- https://verdad.app☆83Updated this week
- Search for words, documents, images, videos, news and maps using the Brave search engine. Downloading files and images to a local hard dr…☆78Updated 5 months ago
- Knowledge chatbot using Agentic Retrieval Augmented Generation (RAG) techniques. Full-stack proof of concept built on langchain, llama-in…☆44Updated 2 years ago
- Docker Streamlit Template☆36Updated 3 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters