katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆5,068Updated this week
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- Knowledge Agents and Management in the Cloud☆4,219Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,243Updated 9 months ago
- A system for agentic LLM-powered data processing and ETL☆3,251Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,460Updated 3 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,952Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,783Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,918Updated 2 months ago
- Improved file parsing for LLM’s☆3,141Updated last year
- The easiest way to use Agentic RAG in any enterprise☆4,372Updated 10 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,413Updated 10 months ago
- Developer APIs to Accelerate LLM Projects☆1,742Updated last year
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,007Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,299Updated 3 weeks ago
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆1,108Updated this week
- A Repo For Document AI☆3,105Updated this week
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,517Updated last week
- High-performance retrieval engine for unstructured data☆1,541Updated last month
- Deploy your agentic worfklows to production☆2,063Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,152Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,470Updated 5 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,689Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,411Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,916Updated last week
- All-in-one platform for search, recommendations, RAG, and analytics offered via API☆2,573Updated 2 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,790Updated 9 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,794Updated 7 months ago
- Open-source tool to visualise your RAG 🔮☆1,199Updated 11 months ago
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,737Updated this week
- The open-source visual AI programming environment and TypeScript library☆4,398Updated 2 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,693Updated this week