Filimoa / open-parse
Improved file parsing for LLMβs
β2,866Updated 4 months ago
Alternatives and similar repositories for open-parse:
Users that are interested in open-parse are comparing it to the libraries listed below
- High-performance retrieval engine for unstructured dataβ1,272Updated this week
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ2,818Updated this week
- RAG that intelligently adapts to your use case, data, and queriesβ3,042Updated 3 weeks ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,563Updated 3 weeks ago
- Knowledge Agents and Management in the Cloudβ3,791Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,198Updated 5 months ago
- Developer APIs to Accelerate LLM Projectsβ1,615Updated 5 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ1,882Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,465Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,328Updated last month
- A Repo For Document AIβ2,756Updated this week
- An open-source visual programming environment for battle-testing prompts to LLMs.β2,537Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.β10,581Updated this week
- A simple, easy-to-hack GraphRAG implementationβ2,681Updated this week
- A system for agentic LLM-powered data processing and ETLβ1,718Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.β2,473Updated last week
- Build and query dynamic, temporally-aware Knowledge Graphsβ2,478Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,338Updated last month
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundryβ3,961Updated last month
- β692Updated last month
- Easy token price estimates for 400+ LLMs. TokenOps.β1,608Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliβ1,924Updated last week
- A blazing fast inference solution for text embeddings modelsβ3,321Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,017Updated this week
- Deploy your agentic worfklows to productionβ1,981Updated 2 weeks ago
- Neo4j graph construction from unstructured data using LLMsβ3,180Updated this week
- Empowering RAG with a memory-based data interface for all-purpose applications!β1,690Updated 3 weeks ago
- Things you can do with the token embeddings of an LLMβ1,433Updated last month
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the oβ¦β2,530Updated 8 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β5,735Updated this week