RapidAI / RapidOrientation
文档方向分类
☆202Updated this week
Related projects ⓘ
Alternatives and complementary repositories for RapidOrientation
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX☆272Updated this week
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆236Updated 2 months ago
- 通过浏览器渲染生成表格图像☆202Updated 7 months ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆250Updated 3 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆121Updated 3 weeks ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆56Updated last year
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆143Updated last week
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆105Updated 8 months ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆45Updated 4 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆55Updated last week
- Based on RapidOCR, extract the PDF content.☆132Updated 2 months ago
- 基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用☆152Updated 4 months ago
- 源自PP-Structure的表格识别算法,模型转换为ONNX,推理引擎采用ONNXRuntime,部署简单,无内存泄露问题。☆39Updated this week
- 轻量模型的图像分析web服务,包括倾斜矫正OCR,公章(印章)检测+识别,车牌识别。api方案使用FastAPI+Gunicorn,提供gradio展示。☆78Updated 6 months ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆134Updated 2 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆90Updated 5 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆48Updated 2 weeks ago
- PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取☆154Updated last year
- Chinese Mathematical Formula Detection (MFD) Dataset 中文文档数学公式检测数据集☆29Updated last year
- transformers ocr for chinese☆358Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆270Updated 3 months ago
- table detect(yolo) , table line(unet)☆236Updated last year
- 中文原生检索增强生成测评基准☆99Updated 6 months ago
- gpt_server是一个用于生产级部署LLMs或Embedding的开源框架。☆121Updated last week
- DocTr++ in PaddlePaddle☆41Updated 3 months ago
- A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.☆169Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆193Updated 3 weeks ago
- chinese document classification of layoutlmv3 and layoutxlm☆41Updated 2 years ago
- 记录 大模型相关的一些知识和方法☆102Updated last week
- ☆156Updated 8 months ago