Based on RapidOCR, extract the PDF content
☆186Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for RapidOCRPDF
Users that are interested in RapidOCRPDF are comparing it to the libraries listed below
Sorting:
- 📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.☆6,153Updated this week
- 文档方向分类☆222Feb 3, 2026Updated last month
- rapidocr onnx cpp☆331Mar 25, 2025Updated 11 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆269Mar 6, 2026Updated 2 weeks ago
- 📝 针对文档类图像做内容提取 ,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆205Nov 1, 2024Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆69May 9, 2023Updated 2 years ago
- RedNote MCP - Xiaohongshu Content Search Tool☆23Jun 26, 2025Updated 8 months ago
- Go-EdgeGPT: Reverse engineered API of Microsoft's Bing Chat AI. 新必应聊天功能的逆向工程☆14Apr 10, 2023Updated 2 years ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆933Aug 3, 2025Updated 7 months ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 7 months ago
- ocr,pdf转docx,pdf to docx☆23Nov 4, 2022Updated 3 years ago
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆415Sep 4, 2025Updated 6 months ago
- Useful resources for creating apps and working with flow.☆11Oct 28, 2024Updated last year
- 修正文档扭曲/模糊/阴影 等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆98Dec 17, 2025Updated 3 months ago
- ☆12Jun 28, 2024Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32May 10, 2023Updated 2 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆307Sep 10, 2024Updated last year
- ☆22Nov 15, 2024Updated last year
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆211Oct 17, 2023Updated 2 years ago
- 中文停用词汇总,持续完善中,欢迎push共建☆16Jun 12, 2023Updated 2 years ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆382Nov 3, 2024Updated last year
- cnki,中国知网,论文,论文下载,摘要查询☆16Mar 31, 2020Updated 5 years ago
- 基于 nanobot 的个人 AI 助手,支持 MiniMax、Gemini 等多模型切换☆42Mar 2, 2026Updated 2 weeks ago
- The intelligent data query plugin under DataFocus that supports multi-round conversations provides plug-and-play ChatBI capabilities.☆14Apr 14, 2025Updated 11 months ago
- 大模型微调工具集合☆26Mar 15, 2024Updated 2 years ago
- Convert the model in PaddleOCR to ONNX format☆113Jul 15, 2025Updated 8 months ago
- 新品葱(WeCenter)数据库在线查询,使用 SQL 语句,支持导出 JSON 格式☆14May 6, 2019Updated 6 years ago
- High-Resolution Google Earth Image Cloud Detection Dataset☆14Dec 21, 2023Updated 2 years ago
- Pretty maps of areas of interest☆15Aug 10, 2023Updated 2 years ago
- TaskingAI Python Client☆21Jan 28, 2025Updated last year
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Apr 28, 2023Updated 2 years ago
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆76Jul 25, 2024Updated last year
- A simple wrapper for hiroi-sora/PaddleOCR-json.☆17Oct 20, 2023Updated 2 years ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- The `Lib-FXML` library simplifies the loading of [JavaFX] relevant files (model, view, controller, .fxml, .css, .properties) and enables …☆19Oct 13, 2020Updated 5 years ago
- 🔥🔥🔥Java代码实现调用RapidOCR(基于PaddleOCR),适配Mac、Win、Linux,支持最新PP-OCRv4☆556Jun 5, 2024Updated last year
- Mojuan: Write your own AI application.☆16Jul 12, 2024Updated last year
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆77Dec 7, 2025Updated 3 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆249Aug 20, 2025Updated 7 months ago