MGzhou / duguang-ocr-onnxLinks
读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English
☆49Updated 5 months ago
Alternatives and similar repositories for duguang-ocr-onnx
Users that are interested in duguang-ocr-onnx are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆64Updated last year
- 中文论文、证券类、财报类PDF数据☆35Updated last year
- Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源…☆106Updated 3 months ago
- 文档方向分类☆225Updated 11 months ago
- 视频理解:千问视频多模态模型 & Dify☆65Updated last year
- 轻量模型的图像分析web服务,包括倾斜矫正OCR,公章(印章)检测+识别,车牌识别。api方案使用FastAPI+Gunicorn,提供gradio展示。☆101Updated last year
- 图片向量检索服务,包含Numpy、Faiss、ES、Milvus多种计算引擎☆137Updated 2 years ago
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆60Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆19Updated last year
- 检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation i…☆113Updated 10 months ago
- ☆27Updated last year
- Based on RapidOCR, extract the PDF content☆181Updated 5 months ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Updated 2 years ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Updated last year
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆85Updated 10 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆41Updated 9 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆24Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- Analysis of Chinese and English layouts 中英文版面分析☆249Updated 2 months ago
- 阅读顺序、Layoutreader☆19Updated 5 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29Updated last year
- 卡证和文档检测和矫正☆74Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,☆45Updated 2 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆25Updated last year
- PDF解析工具:GOT的vLLM加速实现,MinerU做布局识别裁剪、GOT做表格公式解析,实现RAG中的pdf解析☆64Updated 11 months ago
- Style-Text data synthesis tool☆67Updated 10 months ago
- 一款数据标注工具(仿照百度在线标注平台)☆13Updated 4 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- 一站式自动化开源标注平台☆78Updated 3 years ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆98Updated last year