chatdoc-com / OCRFluxLinks
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page content merging.
☆2,416Updated 4 months ago
Alternatives and similar repositories for OCRFlux
Users that are interested in OCRFlux are comparing it to the libraries listed below
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,824Updated 4 months ago
- MultiAgentPPT 是一个集成了 A2A(Agent2Agent)+ MCP(Model Context Protocol)+ ADK(Agent Development Kit) 架构的智能化演示文稿生成系统,支持通过多智能体协作和流式并发机制☆1,449Updated 3 months ago
- ☆810Updated 2 months ago
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,624Updated 3 months ago
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,246Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆1,044Updated 2 weeks ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,299Updated last week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,936Updated this week
- 开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流☆1,838Updated 2 months ago
- ALLWEONE® Open source AI presentation generator Gamma Alternative. Create professional slides with customizable themes and AI-generated c…☆2,253Updated this week
- UltraRAG v2: A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines☆2,386Updated this week
- An Autonomous Agentic Framework for Reflective PowerPoint Generation☆2,966Updated this week
- LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studio…☆3,107Updated 2 months ago
- A smart, powerful, and beautiful excalidraw drawing tool.Draw Professional Charts with Natural Language☆2,553Updated this week
- Transcribe and summarize video content using AI. Open-source, multi-platform, and supports multiple languages.☆1,776Updated 2 months ago
- ☆1,374Updated this week
- 一键将 Markdown 和网页 AI 对话(ChatGPT/DeepSeek等)完美粘贴到 Word、WPS 和 Excel 的效率工具 | One-click paste Markdown and AI responses (ChatGPT/DeepSeek) into…☆2,294Updated this week
- A quick vibe coded app for deepseek OCR☆1,536Updated last month
- A Unicode-based text digital watermarking tool for embedding invisible copyright marks and metadata in text content.☆759Updated 5 months ago
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆4,289Updated last week
- Snip Anything Solve Everything☆1,442Updated 3 weeks ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆906Updated 4 months ago
- Fogsight is an AI agent and animation engine powered by Large Language Models.☆1,963Updated last month
- Package Python projects into executables☆817Updated 3 months ago
- This is a 12306 ticket search server based on the Model Context Protocol (MCP).☆667Updated 2 months ago
- ☆509Updated 9 months ago
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker☆2,026Updated 2 weeks ago
- Out-of-the-box DeepSeek OCR document parsing Web Studio☆528Updated 2 months ago
- AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色☆1,121Updated 6 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,897Updated 8 months ago