chatdoc-com / OCRFluxLinks
OCRFlux is a lightweight yet powerful multimodal toolkit that significantly advances PDF-to-Markdown conversion, excelling in complex layout handling, complicated table parsing and cross-page content merging.
☆1,767Updated last week
Alternatives and similar repositories for OCRFlux
Users that are interested in OCRFlux are comparing it to the libraries listed below
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,511Updated 2 weeks ago
- A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具☆891Updated last month
- MultiAgentPPT 是一个集成了 A2A(Agent2Agent)+ MCP(Model Context Protocol)+ ADK(Agent Development Kit) 架构的智能化演示文稿生成系统,支持通过多智能体协作和流式并发机制☆842Updated this week
- E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with ded…☆1,093Updated 10 months ago
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆627Updated last month
- PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides☆1,731Updated 2 weeks ago
- PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.☆3,052Updated this week
- 🤖 A visualization mcp contains 25+ visual charts using @antvis. Using for chart generation and data analysis.☆1,952Updated last week
- A MCP (Model Context Protocol) server for PowerPoint manipulation using python-pptx. This server provides tools for creating, editing, an…☆594Updated 3 weeks ago
- (Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, …☆1,962Updated 3 weeks ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆4,308Updated last week
- Python library for Agentic Document Extraction from LandingAI☆1,249Updated last week
- PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker☆526Updated last week
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆619Updated this week
- Yet Another Document Translator☆4,600Updated last week
- Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"☆633Updated 4 months ago
- 基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle d…☆1,237Updated 3 weeks ago
- A Unicode-based text digital watermarking tool for embedding invisible copyright marks and metadata in text content.☆691Updated 2 weeks ago
- 一键将音视频转化为小红书/公众号/知识笔记/思维导图/视频字幕等各种风格的文档。☆1,951Updated last week
- OpenAI DeepResearch alternative, An AI-driven research system that performs comprehensive, iterative research on any topic using multiple…☆618Updated last month
- LiberSonora,寓意“自由的声音”,是一个 AI 赋能的、强大的、开源有声书工具集,包含智能字幕提取、AI标题生成、多语言翻译等功能,支持 GPU 加速、批量离线处理。LiberSonora, meaning "The Voice of Freedom," is a…☆426Updated 5 months ago
- AI Prompt Optimization Platform is a professional prompt engineering tool designed to help users optimize AI model prompts, enhancing the…☆440Updated 3 weeks ago
- AI Podcast Generator for bilingual episodes, Multi Languages, Alternative to NotebookLLM;真人对话AI播客生成器,多语言,多音色☆847Updated 2 weeks ago
- Lemon AI is the first Full-stack, Open-source, Agentic AI framework, offering a fully local alternative to platforms like Manus & Genspar…☆544Updated this week
- python package to parse pdfs with different parsers☆197Updated 7 months ago
- [ACL 2025 Demo] Repository for the demo and paper: ReasonGraph: Visualisation of Reasoning Paths☆493Updated last month
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆1,451Updated 3 months ago
- ☆478Updated 4 months ago
- A browser extension that helps users publish content to multiple social media platforms with one click.☆1,778Updated this week
- AI as Workspace - An elegant AI chat client. Full-featured, lightweight. Support multiple workspaces, plugin system, cross-platform, loca…☆1,195Updated 2 weeks ago