TW-NLP / ChineseErrorCorrector
中文拼写错误和语法错误纠正
☆133Updated last week
Alternatives and similar repositories for ChineseErrorCorrector:
Users that are interested in ChineseErrorCorrector are comparing it to the libraries listed below
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆87Updated 2 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆539Updated last year
- 360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute☆281Updated 8 months ago
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆300Updated 9 months ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆223Updated last year
- 该项目是为了使用layoutlmv3针对中文图片训练和推理。 其中主要解决三个问题: 1.数据标准化成可以的训练数据集格式 2.layoutlmv3-base-chinese 分词修改 2.超过512长度的文本切分和滑窗操作☆47Updated 8 months ago
- "桃李“: 国际中文教育大模型☆178Updated last year
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆294Updated 2 years ago
- text correction papers☆306Updated last year
- Baichuan-13B 指令微调☆90Updated last year
- ☆323Updated 10 months ago
- kbqa,langchain,large langauge model, chatgpt☆80Updated 6 months ago
- Analysis of Chinese and English layouts 中英文版面分析☆207Updated last month
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated 10 months ago
- 一个简单快速的分词、命名实体识别工具☆582Updated last month
- 对深度学习中的NLP进行解释和代码使用☆50Updated last year
- code for piccolo embedding model from SenseTime☆123Updated 11 months ago
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆68Updated 2 months ago
- 文本去重☆71Updated 11 months ago
- 活字通用大模型☆388Updated 8 months ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆32Updated 10 months ago
- ChatGLM-6B fine-tuning.☆135Updated 2 years ago
- The code and data for GrammarGPT.☆169Updated last year
- 大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning☆57Updated 9 months ago
- text embedding☆145Updated last year
- baichuan LLM surpervised finetune by lora☆63Updated last year
- 中文世界的NLP自动标注开源工具,简单样本,交给LabelFast。☆70Updated 4 months ago
- transformers结构的中文OFA模型☆130Updated 2 years ago
- 语言模型中文认知能力分析☆237Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year