adithya-s-k / omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
☆5,974Updated 2 months ago
Alternatives and similar repositories for omniparse:
Users that are interested in omniparse are comparing it to the libraries listed below
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆6,398Updated 2 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,430Updated this week
- This project is a template for my help my pupils from SENAC build your own project Html5☆11Updated 6 months ago
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆4,258Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆15,474Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,719Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆28,471Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,576Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆24,558Updated this week
- Parse files for optimal RAG☆3,526Updated last week
- PDF to Markdown with vision models☆8,298Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,320Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆5,961Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆5,509Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆21,705Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,518Updated 2 weeks ago
- tiny vision language model☆6,732Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆18,641Updated this week
- Improved file parsing for LLM’s☆2,637Updated 2 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆19,314Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆2,188Updated last week
- The easiest way to use Agentic RAG in any enterprise☆3,972Updated 2 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆4,966Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,785Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆21,693Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,775Updated 3 months ago
- Automate browser-based workflows with LLMs and Computer Vision☆11,426Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆13,445Updated this week
- Neo4j graph construction from unstructured data using LLMs☆2,777Updated this week
- A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.☆6,810Updated this week