通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser
☆48Jun 13, 2024Updated last year
Alternatives and similar repositories for General-Documents-Layout-parser
Users that are interested in General-Documents-Layout-parser are comparing it to the libraries listed below
Sorting:
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆63Sep 6, 2024Updated last year
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Apr 21, 2025Updated 10 months ago
- chinese document classification of layoutlmv3 and layoutxlm☆46Oct 25, 2022Updated 3 years ago
- ☆29Feb 27, 2025Updated last year
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 9 months ago
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- Yet Another Papers With Code☆35Sep 7, 2025Updated 5 months ago
- [ICME'23, oral] CCLAP: Controllable Chinese Landscape Painting Generation☆19Apr 20, 2025Updated 10 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 3 months ago
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆16Sep 15, 2024Updated last year
- ☆11Updated this week
- TianGong-AI-Unstructure☆71Feb 4, 2026Updated 3 weeks ago
- SwanLab Local Visualization Python Package Plugin|SwanLab本地可视化python包插件☆24Feb 11, 2026Updated 2 weeks ago
- ☆21Dec 24, 2024Updated last year
- ☆18Feb 5, 2026Updated 3 weeks ago
- Rephrasing Language Model for CSC (AAAI 2024)☆44May 14, 2024Updated last year
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆127Jun 4, 2025Updated 8 months ago
- ☆47Jul 19, 2022Updated 3 years ago
- 基于电商数据微调的Qwen1.5系列的电商大模型,包括0.5b-base,0.5b-chat,1.8b-base,7b-base,以及基于llama3-chinese-sft版本的基础模型的sft后电商大模型。☆22May 14, 2024Updated last year
- Table Structure Recognition☆28Jul 25, 2024Updated last year
- Deepseek-r1复现科普与资源汇总☆22Mar 5, 2025Updated 11 months ago
- A graph rag for PDFs based on langchain and Neo4j. Can fetch PDFs from Zotero Library through zotero api.☆29Jun 26, 2024Updated last year
- ☆97Jul 12, 2022Updated 3 years ago
- ☆156May 8, 2025Updated 9 months ago
- 使用FastAPI+vLLM部署Qwen2.5☆26Sep 29, 2024Updated last year
- 利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure☆28Feb 23, 2024Updated 2 years ago
- 【间隙·树·排序算法】 对OCR结果或PDF提取的文本进行版面分析,按人类阅读顺序进行排序。☆163Feb 28, 2024Updated 2 years ago
- ☆24May 21, 2025Updated 9 months ago
- ☆26May 11, 2025Updated 9 months ago
- 适用于DOW的群聊总结☆28Feb 27, 2025Updated last year
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆56Nov 14, 2025Updated 3 months ago
- 人机对话评测任务整理(DSTC、ConAI2、SMP-ECDT、JD DC);重点介绍中文人机对话评测(SMP-ECDT)相关任务及方案。☆25Apr 13, 2021Updated 4 years ago
- benchmark of KgCLUE, with different models and methods☆28Dec 13, 2021Updated 4 years ago
- 文档方向分类☆222Feb 3, 2026Updated 3 weeks ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Dec 12, 2024Updated last year
- ☆24Oct 8, 2021Updated 4 years ago