katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆5,084Updated 2 weeks ago
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- A system for agentic LLM-powered data processing and ETL☆3,355Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,473Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,227Updated 3 weeks ago
- Improved file parsing for LLM’s☆3,146Updated last year
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,309Updated last month
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,033Updated this week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,791Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,496Updated 5 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,927Updated 3 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,256Updated 10 months ago
- Deploy your agentic worfklows to production☆2,067Updated 3 weeks ago
- Developer APIs to Accelerate LLM Projects☆1,743Updated last year
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,132Updated this week
- A Repo For Document AI☆3,111Updated this week
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆1,121Updated this week
- The easiest way to use Agentic RAG in any enterprise☆4,384Updated 11 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,963Updated last month
- A language model programming library.☆5,869Updated 7 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,589Updated 3 weeks ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,276Updated 9 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,803Updated 2 weeks ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,811Updated 10 months ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,573Updated last week
- The modern replacement for Jupyter Notebooks☆2,180Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 7 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,544Updated this week
- Lightweight library for scraping web-sites with LLMs☆1,252Updated 3 weeks ago
- Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images☆2,716Updated this week
- An easy way to extract information from documents☆1,784Updated 2 years ago
- structured outputs for llms☆12,065Updated last week