Filimoa / open-parseLinks
Improved file parsing for LLM’s
☆3,023Updated 8 months ago
Alternatives and similar repositories for open-parse
Users that are interested in open-parse are comparing it to the libraries listed below
Sorting:
- High-performance retrieval engine for unstructured data☆1,459Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,255Updated 4 months ago
- Developer APIs to Accelerate LLM Projects☆1,695Updated 9 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,409Updated last month
- Knowledge Agents and Management in the Cloud☆4,069Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,311Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,587Updated 2 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,253Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,287Updated 9 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,331Updated last week
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,530Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,714Updated 5 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,505Updated 2 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆1,863Updated 3 months ago
- A Repo For Document AI☆2,899Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,616Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,445Updated 4 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,691Updated last week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,123Updated last month
- LLM(😽)☆1,682Updated 5 months ago
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆1,331Updated 10 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,754Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆4,926Updated 3 weeks ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,226Updated 2 months ago
- A system for agentic LLM-powered data processing and ETL☆2,371Updated last week
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆994Updated last week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,132Updated 11 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,163Updated 5 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,088Updated this week
- All-in-one platform for search, recommendations, RAG, and analytics offered via API☆2,388Updated this week