chatdoc-com / OCRFluxLinks
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page content merging.
☆2,189Updated 3 weeks ago
Alternatives and similar repositories for OCRFlux
Users that are interested in OCRFlux are comparing it to the libraries listed below
Sorting:
- MultiAgentPPT 是一个集成了 A2A(Agent2Agent)+ MCP(Model Context Protocol)+ ADK(Agent Development Kit) 架构的智能化演示文稿生成系统,支持通过多智能体协作和流式并发机制☆1,273Updated this week
- A lightweight LMM-based Document Parsing Model☆5,614Updated this week
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,665Updated this week
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,224Updated 11 months ago
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆1,563Updated last month
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.☆1,938Updated last week
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆670Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆3,671Updated last week
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]☆1,915Updated 2 weeks ago
- LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studio…☆2,737Updated last week
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆908Updated 3 weeks ago
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆3,190Updated last month
- Snip Anything Solve Everything☆1,160Updated last week
- iFlow cli is a comprehensive command-line intelligence that embeds in your terminal, analyzes your repositories, does coding tasks, inter…☆752Updated last week
- ☆489Updated 5 months ago
- ☆542Updated last year
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆752Updated last week
- (Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …☆2,031Updated last week
- A Unicode-based text digital watermarking tool for embedding invisible copyright marks and metadata in text content.☆734Updated last month
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆494Updated 3 months ago
- This is a 12306 ticket search server based on the Model Context Protocol (MCP).☆519Updated last week
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆5,608Updated 2 weeks ago
- RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.☆2,577Updated 4 months ago
- 🤖 A visualization mcp contains 25+ visual charts using @antvis. Using for chart generation and data analysis.☆2,609Updated last week
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆531Updated 2 months ago
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆638Updated 6 months ago
- 超高性能、安全的一站式开源资源获取加速引擎。其性能远超传统加速器,为您提供跨多个平台的统一高效的加速体验,涵盖代码储存库、包管理、AI 推理 API、容器镜像、模型及数据集等 | Ultra-high performance, secure, all-in-one open…☆2,080Updated this week
- Next-Gen AI Translation Tool Powered by LLM. Support Office documents, PDF, TXT, and more format with just one click.☆197Updated last week
- AI as Workspace - An elegant AI chat client. Full-featured, lightweight. Support multiple workspaces, plugin system, cross-platform, loca…☆1,271Updated 2 weeks ago
- AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色☆1,007Updated last month