adithya-s-k / omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
☆6,405Updated 4 months ago
Alternatives and similar repositories for omniparse:
Users that are interested in omniparse are comparing it to the libraries listed below
- This project is a template for my help my pupils from SENAC build your own project Html5☆11Updated 8 months ago
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆6,064Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,312Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆23,734Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,120Updated 2 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,257Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,252Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆28,808Updated this week
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆3,206Updated this week
- OCR & Document Extraction using vision models☆10,605Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆31,990Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,220Updated 2 months ago
- 🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, repo…☆7,134Updated this week
- A collection of MCP servers.☆13,804Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,166Updated this week
- An open-source RAG-based tool for chatting with your documents.☆21,739Updated last month
- Task-Aware Agent-driven Prompt Optimization Framework☆3,002Updated this week
- 🪄 Create rich visualizations with AI☆10,640Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆23,161Updated this week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,818Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆3,736Updated this week
- SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.☆5,959Updated 3 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,563Updated 3 weeks ago
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,004Updated 2 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,961Updated last month
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆44,716Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆20,951Updated this week