Filimoa / open-parseLinks
Improved file parsing for LLM’s
☆3,013Updated 7 months ago
Alternatives and similar repositories for open-parse
Users that are interested in open-parse are comparing it to the libraries listed below
Sorting:
- High-performance retrieval engine for unstructured data☆1,439Updated 3 weeks ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,688Updated 4 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.☆1,254Updated 3 months ago
- Developer APIs to Accelerate LLM Projects☆1,683Updated 8 months ago
- Knowledge Agents and Management in the Cloud☆4,046Updated this week
- RAG that intelligently adapts to your use case, data, and queries☆3,353Updated 3 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,560Updated last month
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,298Updated last week
- The Open Source Memory Layer For Autonomous Agents☆2,275Updated 8 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,728Updated this week
- Things you can do with the token embeddings of an LLM☆1,442Updated 3 months ago
- A Repo For Document AI☆2,874Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,671Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,137Updated 4 months ago
- Deploy your agentic worfklows to production☆2,035Updated last week
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDF☆976Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,192Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,492Updated last month
- Empowering RAG with a memory-based data interface for all-purpose applications!☆1,847Updated 2 months ago
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,873Updated this week
- Open-source tool to visualise your RAG 🔮☆1,144Updated 6 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆4,607Updated last week
- Seamlessly integrate LLMs as Python functions☆2,332Updated 2 weeks ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,608Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,376Updated 5 months ago
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,525Updated 11 months ago
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,717Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,269Updated last week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,133Updated last week
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cro…☆823Updated last week