Filimoa / open-parseLinks
Improved file parsing for LLM’s
☆3,152Updated last year
Alternatives and similar repositories for open-parse
Users that are interested in open-parse are comparing it to the libraries listed below
Sorting:
- High-performance retrieval engine for unstructured data☆1,553Updated 2 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,276Updated 10 months ago
- Developer APIs to Accelerate LLM Projects☆1,742Updated last year
- Knowledge Agents and Management in the Cloud☆4,231Updated last week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,850Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,687Updated 3 months ago
- A Repo For Document AI☆3,133Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,935Updated 4 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,653Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,562Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,849Updated 8 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,687Updated 3 weeks ago
- LLM(😽)☆1,697Updated last year
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,316Updated 2 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,250Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,452Updated 2 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,592Updated last month
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,602Updated last year
- Deploy your agentic worfklows to production☆2,073Updated last week
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,203Updated 4 months ago
- Lightweight, performant, deep table extraction☆524Updated 3 weeks ago
- The easiest way to use Agentic RAG in any enterprise☆4,396Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,105Updated this week
- An open-source visual programming environment for battle-testing prompts to LLMs.☆2,918Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,478Updated 5 months ago
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,458Updated last year
- PyMuPDF4LLM☆1,277Updated last week
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,368Updated 8 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,731Updated 3 weeks ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,794Updated last month