该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作
☆63Sep 6, 2024Updated last year
Alternatives and similar repositories for layoutlmv3-chinese
Users that are interested in layoutlmv3-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chinese document classification of layoutlmv3 and layoutxlm☆45Oct 25, 2022Updated 3 years ago
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- ☆69Sep 24, 2023Updated 2 years ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆17Apr 12, 2024Updated 2 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆10Jun 22, 2020Updated 5 years ago
- 表格结构识别LGPMA推理☆25Nov 17, 2022Updated 3 years ago
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆306Sep 10, 2024Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆48Jun 13, 2024Updated last year
- Table Structure Recognition☆28Jul 25, 2024Updated last year
- CDLA: A Chinese document layout analysis (CDLA) dataset☆294Sep 13, 2021Updated 4 years ago
- ☆19Mar 10, 2023Updated 3 years ago
- ICDAR 2024 Table OCR Model☆39Feb 25, 2026Updated 3 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆43Oct 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆14Sep 6, 2024Updated last year
- TRACE: Table Reconstruction Aligned to Corner and Edges (ICDAR 2023)☆32Mar 13, 2024Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- DocTr++ in PaddlePaddle☆57Jul 24, 2024Updated last year
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- 基于TrOCR + UniMER-1M数据集,训练一个小而美的公式识别模型☆29Mar 17, 2026Updated 2 months ago
- ☆13May 28, 2025Updated last year
- 百度网盘AI大赛——图像处理挑战赛:文档图像摩尔纹消除第2名方案☆43Nov 28, 2023Updated 2 years ago
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆45Apr 21, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆101Dec 23, 2024Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Oct 24, 2024Updated last year
- ☆19Feb 5, 2026Updated 3 months ago
- ☆14Jan 15, 2026Updated 4 months ago
- ☆33Dec 18, 2025Updated 5 months ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 3 years ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆277Dec 6, 2025Updated 5 months ago
- ☆20Jun 21, 2024Updated last year
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆102Dec 17, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- Analysis of Chinese and English layouts 中英文版面分析☆272Mar 24, 2026Updated 2 months ago
- [VISAPP 2022] MdVRNet: Deep Video Restoration under Multiple Distortions☆12Aug 7, 2024Updated last year
- android 读取hex文件 通过蓝牙下载到 stm32单片机☆11Nov 6, 2017Updated 8 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆320Aug 15, 2025Updated 9 months ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆21Mar 11, 2026Updated 2 months ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆228Sep 9, 2024Updated last year