Blacksuan19 / scrapy-aiLinks
Fully automated AI based web scraping.
☆26Updated 7 months ago
Alternatives and similar repositories for scrapy-ai
Users that are interested in scrapy-ai are comparing it to the libraries listed below
Sorting:
- Spider ported to Python☆94Updated 8 months ago
- Real-time collaborative notebook that thinks with you☆34Updated this week
- Docx tracked change redlines for the Python ecosystem.☆83Updated last year
- A Prodigy plugin for PDF annotation☆35Updated last month
- scraping and querying documents for LLMs☆24Updated this week
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆30Updated 2 years ago
- Taking Normal Text as Input and Generating SQL commands using the OpenAI's GPT-3☆15Updated 5 years ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆104Updated 11 months ago
- Open-source versioning, tracing, and annotation tooling.☆194Updated this week
- Import unstructured data (text and images) into structured tables☆154Updated 5 months ago
- https://verdad.app☆83Updated this week
- S3 vector database for LLM Agents and RAG.☆48Updated 2 years ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆134Updated this week
- Example LangGraph flow that does "competitor analysis" on the web.☆34Updated last year
- simplifies the process of creating and managing LLM workflows.☆109Updated 11 months ago
- Semantic caching layer for your LLM applications. Reuse responses and reduce token usage.☆85Updated 3 months ago
- DuckDB Community Extension to prompt LLMs from SQL☆50Updated last week
- Code examples for Building Effective Agents ported and adapted to use Pydantic AI☆89Updated 9 months ago
- Open Source LLMOps tool for AI teams☆127Updated 7 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆301Updated 2 weeks ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆27Updated last year
- Convert and invoke OpenAPI specifications as LLM tool/function definitions☆39Updated 5 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆148Updated 9 months ago
- Python SDK for Browserbase☆63Updated last week
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆44Updated last month
- F.L.A.T. - FlawLess AgenTs☆36Updated 7 months ago
- agenty☆43Updated 7 months ago
- Playing with Python Bluesky SDK☆15Updated 10 months ago
- FalkorDB Python Client☆34Updated last week
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable p…☆84Updated last year