Filimoa / open-parseLinks

Improved file parsing for LLM’s

☆3,023

Alternatives and similar repositories for open-parse

Users that are interested in open-parse are comparing it to the libraries listed below

Sorting:

D-Star-AI / dsRAG
High-performance retrieval engine for unstructured data
☆1,459Updated this week
nlmatics / nlm-ingestor
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
☆1,255Updated 4 months ago
nlmatics / llmsherpa
Developer APIs to Accelerate LLM Projects
☆1,695Updated 9 months ago
circlemind-ai / fast-graphrag
RAG that intelligently adapts to your use case, data, and queries
☆3,409Updated last month
run-llama / llama_cloud_services
Knowledge Agents and Management in the Cloud
☆4,069Updated last week
lumina-ai-inc / chunkr
Vision infrastructure to turn complex documents into RAG/LLM-ready data
☆2,311Updated last month
AnswerDotAI / RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,587Updated 2 months ago
qdrant / fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
☆2,253Updated last week
kingjulio8238 / Memary
The Open Source Memory Layer For Autonomous Agents
☆2,287Updated 9 months ago
michaelfeil / infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
☆2,331Updated last week
Dataherald / dataherald
Interact with your SQL database, Natural Language to SQL using LLMs
☆3,530Updated last year
Dicklesworthstone / llm_aided_ocr
Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.
☆2,714Updated 5 months ago
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
qhjqhj00 / MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
☆1,863Updated 3 months ago
deepdoctection / deepdoctection
A Repo For Document AI
☆2,899Updated last week
pingcap / autoflow
pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…
☆2,616Updated 2 weeks ago
dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,445Updated 4 months ago
aurelio-labs / semantic-router
Superfast AI decision making and intelligent processing of multi-modal data.
☆2,691Updated last week
SciPhi-AI / R2R
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
☆7,123Updated last month
eyurtsev / kor
LLM(😽)
☆1,682Updated 5 months ago
parthsarthi03 / raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
☆1,331Updated 10 months ago
AgentOps-AI / tokencost
Easy token price estimates for 400+ LLMs. TokenOps.
☆1,754Updated this week
katanaml / sparrow
Structured data extraction and instruction calling with ML, LLM and Vision LLM
☆4,926Updated 3 weeks ago
X-PLUG / mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
☆2,226Updated 2 months ago
ucbepic / docetl
A system for agentic LLM-powered data processing and ETL
☆2,371Updated last week
pymupdf / RAG
RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF
☆994Updated last week
lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,132Updated 11 months ago
truefoundry / cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
☆4,163Updated 5 months ago
illuin-tech / colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,088Updated this week
devflowinc / trieve
All-in-one platform for search, recommendations, RAG, and analytics offered via API
☆2,388Updated this week