Based on RapidOCR, extract the PDF content
☆185May 7, 2025Updated 9 months ago
Alternatives and similar repositories for RapidOCRPDF
Users that are interested in RapidOCRPDF are comparing it to the libraries listed below
Sorting:
- 📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.☆5,980Feb 13, 2026Updated 2 weeks ago
- 文档方向分类☆222Feb 3, 2026Updated 3 weeks ago
- rapidocr onnx cpp☆323Mar 25, 2025Updated 11 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆205Nov 1, 2024Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆267Updated this week
- ☆22Nov 15, 2024Updated last year
- ☆12Jun 28, 2024Updated last year
- 基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。☆410Sep 4, 2025Updated 5 months ago
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆924Aug 3, 2025Updated 6 months ago
- Full text search engine powered by LotusDB.☆21Jan 21, 2024Updated 2 years ago
- Real time faster whisper gradio☆25Aug 17, 2025Updated 6 months ago
- Useful resources for creating apps and working with flow.☆11Oct 28, 2024Updated last year
- RedNote MCP - Xiaohongshu Content Search Tool☆21Jun 26, 2025Updated 8 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68May 9, 2023Updated 2 years ago
- Automatic development for retrieval augmented generation system☆10Feb 2, 2025Updated last year
- Run a fleet of CLI agents to do tasks and research☆19Dec 11, 2025Updated 2 months ago
- The intelligent data query plugin under DataFocus that supports multi-round conversations provides plug-and-play ChatBI capabilities.☆14Apr 14, 2025Updated 10 months ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆95Dec 17, 2025Updated 2 months ago
- 大模型微调工具集合☆26Mar 15, 2024Updated last year
- 中文停用词汇总,持续完善中,欢迎push共建☆16Jun 12, 2023Updated 2 years ago
- Vscode Samge Translate 翻译助手:Quickly translate text right in your code 🚀 支持多种翻译命令(英译中、中译英、中文转多规则命名变量等),支持多种结果展示方式,支持配置百度、阿里、腾讯、火山、有道、Deep…☆14Mar 24, 2025Updated 11 months ago
- 基于 canvas 绘制的组织架构图 / 目录树☆13Apr 9, 2024Updated last year
- cnki,中国知网,论文,论文下载,摘要查询☆15Mar 31, 2020Updated 5 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32May 10, 2023Updated 2 years ago
- Tensorflow 版本的图片鉴黄。not suitable/safe for work (NSFW) images detection using Tensorflow☆11Apr 11, 2020Updated 5 years ago
- Mojuan: Write your own AI application.☆15Jul 12, 2024Updated last year
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆59Apr 28, 2023Updated 2 years ago
- snowflake ruby impl☆13May 17, 2017Updated 8 years ago
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- Pytorch implementation of math equation images to latex markup language.☆30Oct 25, 2020Updated 5 years ago
- ☆15Jun 20, 2024Updated last year
- Use a LlamaIndex Agent as a backend service☆23May 3, 2024Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Dec 2, 2024Updated last year
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆26Updated this week
- Convert the model in PaddleOCR to ONNX format☆112Jul 15, 2025Updated 7 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆77Dec 7, 2025Updated 2 months ago
- TaskingAI Python Client☆21Jan 28, 2025Updated last year
- ☆17Mar 2, 2024Updated 2 years ago
- 策略基类/ 支持QIFI协议☆15Feb 16, 2021Updated 5 years ago