katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆5,052Updated this week
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,285Updated 2 months ago
- Improved file parsing for LLM’s☆3,134Updated last year
- A system for agentic LLM-powered data processing and ETL☆3,101Updated last week
- Knowledge Agents and Management in the Cloud☆4,205Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,450Updated 4 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,234Updated 9 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,910Updated 2 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,362Updated 10 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,775Updated 6 months ago
- Developer APIs to Accelerate LLM Projects☆1,735Updated last year
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,715Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,456Updated 3 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,254Updated last week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,486Updated 3 weeks ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,763Updated this week
- Large Action Model framework to develop AI Web Agents☆6,209Updated 10 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,516Updated 6 months ago
- 🦜⛏️ Did you say you like data?☆1,176Updated last month
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,963Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,524Updated this week
- A Repo For Document AI☆3,075Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,940Updated 2 months ago
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆959Updated this week
- Open-source tool to visualise your RAG 🔮☆1,198Updated 10 months ago
- Deploy your agentic worfklows to production☆2,065Updated 2 months ago
- structured outputs for llms☆11,868Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,149Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,387Updated 6 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,779Updated 9 months ago
- An easy way to extract information from documents☆1,783Updated 2 years ago