Filimoa / open-parseLinks
Improved file parsing for LLM’s
☆2,997Updated 7 months ago
Alternatives and similar repositories for open-parse
Users that are interested in open-parse are comparing it to the libraries listed below
Sorting:
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,245Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,496Updated last month
- Developer APIs to Accelerate LLM Projects☆1,679Updated 8 months ago
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- High-performance retrieval engine for unstructured data☆1,408Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,674Updated 3 months ago
- Data processing and instruction calling with ML, LLM and Vision LLM☆4,574Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,449Updated 3 weeks ago
- Deploy your agentic worfklows to production☆2,022Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,133Updated 3 weeks ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,234Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,206Updated last week
- A Repo For Document AI☆2,851Updated this week
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,201Updated 2 weeks ago
- RAG that intelligently adapts to your use case, data, and queries☆3,315Updated 2 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,634Updated last month
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,525Updated this week
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,636Updated 11 months ago
- Structured Text Generation☆11,750Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆6,968Updated last week
- Supercharge Your LLM Application Evaluations 🚀☆9,535Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,101Updated 3 months ago
- structured outputs for llms☆10,747Updated this week
- LLM(😽)☆1,673Updated 4 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,276Updated last week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,953Updated last week
- SoTA LLM for converting natural language questions to SQL queries☆3,791Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,077Updated this week
- The Open Source Memory Layer For Autonomous Agents☆2,247Updated 7 months ago
- Open-source tool to visualise your RAG 🔮☆1,136Updated 5 months ago