Filimoa / open-parseLinks
Improved file parsing for LLM’s
☆3,129Updated 11 months ago
Alternatives and similar repositories for open-parse
Users that are interested in open-parse are comparing it to the libraries listed below
Sorting:
- High-performance retrieval engine for unstructured data☆1,517Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,274Updated 7 months ago
- Developer APIs to Accelerate LLM Projects☆1,732Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,773Updated 8 months ago
- A Repo For Document AI☆3,033Updated this week
- Knowledge Agents and Management in the Cloud☆4,198Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,755Updated 5 months ago
- RAG that intelligently adapts to your use case, data, and queries☆3,575Updated last week
- An open-source visual programming environment for battle-testing prompts to LLMs.☆2,866Updated last week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,772Updated last year
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,577Updated last year
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,904Updated last month
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,832Updated 2 months ago
- Deploy your agentic worfklows to production☆2,059Updated 2 months ago
- PyMuPDF4LLM☆1,113Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,565Updated 5 months ago
- The Open Source Memory Layer For Autonomous Agents☆2,503Updated last year
- Things you can do with the token embeddings of an LLM☆1,450Updated 2 weeks ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,031Updated last week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,882Updated last week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,760Updated this week
- Lightweight, performant, deep table extraction☆513Updated 3 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,532Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,477Updated last week
- Seamlessly integrate LLMs as Python functions☆2,378Updated 3 weeks ago
- LLM(😽)☆1,683Updated 9 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,159Updated 2 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,258Updated 5 months ago
- 🦜⛏️ Did you say you like data?☆1,173Updated 3 weeks ago
- ☆829Updated this week