katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆4,999Updated last week
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,411Updated 3 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,246Updated 3 weeks ago
- Knowledge Agents and Management in the Cloud☆4,144Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,889Updated last week
- A Repo For Document AI☆2,964Updated last week
- Improved file parsing for LLM’s☆3,089Updated 10 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,883Updated last week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,329Updated last month
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,745Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,182Updated 7 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,858Updated last month
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,351Updated 2 months ago
- The easiest way to use Agentic RAG in any enterprise☆4,333Updated 8 months ago
- Developer APIs to Accelerate LLM Projects☆1,724Updated 11 months ago
- Lightweight library for scraping web-sites with LLMs☆1,219Updated last month
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,497Updated 4 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,806Updated this week
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆931Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,400Updated 3 weeks ago
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,574Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,264Updated 4 months ago
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,794Updated 6 months ago
- Large Action Model framework to develop AI Web Agents☆6,176Updated 8 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,747Updated last week
- The modern replacement for Jupyter Notebooks☆2,166Updated 9 months ago
- High-performance retrieval engine for unstructured data☆1,502Updated last month
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,741Updated last year
- ☆2,031Updated 6 months ago
- Superduper: End-to-end framework for building custom AI applications and agents.☆5,213Updated 3 weeks ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,195Updated this week