Spico197 / CatalogExtraction
🌳CED: Catalog Extraction from Documents
☆15Updated last year
Related projects: ⓘ
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆73Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆104Updated last month
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆41Updated 3 months ago
- ☆44Updated 9 months ago
- ☆57Updated last year
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆70Updated last year
- 历届中文句法错误诊断技术评测数据集☆33Updated 2 years ago
- Rephrasing Language Model for CSC (AAAI 2024)☆33Updated 4 months ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆79Updated 6 months ago
- ☆27Updated last year
- 基于模板的文本纠错;Automatically Mining Error Templates for Grammatical Error Correction☆36Updated 2 years ago
- CCL 2022 汉语学习者文本纠错评测☆133Updated last year
- ☆50Updated 6 months ago
- ☆54Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆51Updated last month
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆27Updated 3 months ago
- ☆56Updated last month
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆67Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆45Updated last year
- LLM for NER☆47Updated last month
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆14Updated 6 months ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆144Updated last year
- Viscacha:通用信息抽取数据集收集☆22Updated 7 months ago
- 中文文本纠错相关的论文、比赛和工具。☆46Updated 2 months ago
- CCL 2023 汉语学习者文本纠错评测☆25Updated last year
- 中文bigbird预训练模型☆86Updated 2 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆26Updated last year
- CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案☆52Updated last year
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆29Updated last year
- chinese document classification of layoutlmv3 and layoutxlm☆38Updated last year