chatdoc-com / OCRFluxLinks
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page content merging.
☆2,252Updated last month
Alternatives and similar repositories for OCRFlux
Users that are interested in OCRFlux are comparing it to the libraries listed below
Sorting:
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,228Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆682Updated 2 weeks ago
- MultiAgentPPT 是一个集成了 A2A(Agent2Agent)+ MCP(Model Context Protocol)+ ADK(Agent Development Kit) 架构的智能化演示文稿生成系统,支持通过多智能体协作和流式并发机制☆1,308Updated last week
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,582Updated 2 weeks ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆4,341Updated last week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,729Updated 3 weeks ago
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker☆1,011Updated this week
- 一个基于LLM的演示文稿生成平台,能够自动将文档内容转换为专业的PPT演示文稿。平台支持多种AI模型,提供丰富的模板和样式选择,让用户能够创建高质量的演示文稿。☆1,067Updated this week
- Snip Anything Solve Everything☆1,236Updated 3 weeks ago
- LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studio…☆2,875Updated 2 weeks ago
- 基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…☆1,367Updated 2 months ago
- (Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …☆2,040Updated last month
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆3,219Updated last month
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]☆1,981Updated 2 weeks ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆824Updated this week
- MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆251Updated 9 months ago
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆982Updated last month
- UltraRAG 2.0: Less Code, Lower Barrier, Faster Deployment! MCP-based low-code RAG framework, enabling researchers to build complex pipeli…☆1,565Updated last week
- 基于MinerU的桌面应用程序,MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆106Updated 11 months ago
- A Unicode-based text digital watermarking tool for embedding invisible copyright marks and metadata in text content.☆740Updated 2 months ago
- AI as Workspace - An elegant AI chat client. Full-featured, lightweight. Support multiple workspaces, plugin system, cross-platform, loca…☆1,325Updated last week
- AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色☆1,036Updated 2 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,616Updated 5 months ago
- 🤖 A visualization mcp contains 25+ visual charts using @antvis. Using for chart generation and data analysis.☆2,786Updated this week
- ☆541Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post…☆830Updated last month
- ☆490Updated 6 months ago
- AI 视频笔记生成工具 让 AI 为你的视频做笔记☆3,647Updated 2 months ago
- AingDesk是一款简单好用的AI助手,支持知识库、模型API、分享、联网搜索、智能体,它还在飞快成长中。 AingDesk is a simple and easy-to-use AI assistant that supports knowledge bases, m…☆2,300Updated 2 months ago
- Lightweight MCP Server for Computer Use in Windows☆2,795Updated this week