Filimoa / open-parse
Improved file parsing for LLMβs
β2,637Updated 2 months ago
Alternatives and similar repositories for open-parse:
Users that are interested in open-parse are comparing it to the libraries listed below
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ2,249Updated this week
- High-performance retrieval engine for unstructured dataβ1,110Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,172Updated 4 months ago
- RAG that intelligently adapts to your use case, data, and queriesβ2,747Updated this week
- Parse files for optimal RAGβ3,526Updated last week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,154Updated 3 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,518Updated 2 weeks ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,298Updated 4 months ago
- A system for agentic LLM-powered data processing and ETLβ1,514Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliβ1,714Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,671Updated this week
- Things you can do with the token embeddings of an LLMβ1,411Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,159Updated this week
- Developer APIs to Accelerate LLM Projectsβ1,518Updated 2 months ago
- The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.β4,437Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising qualityβ3,442Updated 5 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!β1,502Updated last month
- The Open Source Memory Layer For Autonomous Agentsβ1,955Updated 2 months ago
- Deploy your agentic worfklows to productionβ1,915Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.β2,294Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β2,898Updated last week
- Vision model based document ingestionβ1,302Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning aβ¦β4,258Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Geβ¦β5,192Updated this week
- A Repo For Document AIβ2,659Updated this week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,022Updated this week
- Harness LLMs with Multi-Agent Programmingβ2,917Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.β2,908Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!β5,269Updated this week
- The code used to train and run inference with the ColPali architecture.β1,386Updated this week