katanaml / sparrowLinks
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆5,105Updated last week
Alternatives and similar repositories for sparrow
Users that are interested in sparrow are comparing it to the libraries listed below
Sorting:
- A Repo For Document AI☆3,136Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,525Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,483Updated 5 months ago
- Knowledge Agents and Management in the Cloud☆4,231Updated last week
- Improved file parsing for LLM’s☆3,152Updated last year
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,314Updated 2 months ago
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆1,149Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,935Updated 4 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,973Updated 2 months ago
- Developer APIs to Accelerate LLM Projects☆1,742Updated last year
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,275Updated 11 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,140Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆6,095Updated this week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,802Updated this week
- Detect and extract tables to markdown and csv☆754Updated last year
- Document to Markdown OCR library with Llama 3.2 vision☆2,424Updated last year
- An easy way to extract information from documents☆1,787Updated 2 years ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆13,915Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,551Updated 6 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,228Updated last week
- ☆2,112Updated 10 months ago
- docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.☆5,850Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,849Updated 8 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,525Updated 8 months ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,102Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,687Updated last month
- Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...☆1,526Updated 2 weeks ago
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,819Updated 2 months ago
- 🦜⛏️ Did you say you like data?☆1,185Updated this week
- Open-source tool to visualise your RAG 🔮☆1,216Updated last year