katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆4,961Updated last week
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- Knowledge Agents and Management in the Cloud☆4,123Updated this week
- Improved file parsing for LLM’s☆3,048Updated 9 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,392Updated last week
- A system for agentic LLM-powered data processing and ETL☆2,793Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,131Updated 6 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,196Updated this week
- Developer APIs to Accelerate LLM Projects☆1,712Updated 10 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,837Updated last month
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,282Updated last month
- A Repo For Document AI☆2,941Updated last week
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,553Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,750Updated 6 months ago
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆924Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,817Updated 2 weeks ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,737Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆12,559Updated this week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,524Updated 2 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,464Updated 3 months ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,746Updated this week
- Lightweight library for scraping web-sites with LLMs☆1,210Updated last week
- The easiest way to use Agentic RAG in any enterprise☆4,315Updated 7 months ago
- High-performance retrieval engine for unstructured data☆1,486Updated last month
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,118Updated last week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,265Updated 5 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,382Updated 7 months ago
- ETL, Analytics, Versioning for Unstructured Data☆2,623Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,088Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,335Updated last week
- Detect and extract tables to markdown and csv☆750Updated 7 months ago
- The modern replacement for Jupyter Notebooks☆2,159Updated 9 months ago