zejunwang1 / CTCDatasetLinks
中文文本纠错数据集汇总
☆17Updated this week
Alternatives and similar repositories for CTCDataset
Users that are interested in CTCDataset are comparing it to the libraries listed below
Sorting:
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- 音乐类语料的意图识别填槽以及槽值纠错模型☆16Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- This repository provides an implementation of the paper "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Co…☆70Updated 3 months ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆13Updated 2 years ago
- baichuan LLM surpervised finetune by lora☆63Updated 2 years ago
- LORA微调BLOOMZ,参考BELLE☆25Updated 2 years ago
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- flow mirror models from JZX AI Labs☆44Updated 8 months ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated last year
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- Rephrasing Language Model for CSC (AAAI 2024)☆41Updated last year
- 多轮共情对话模型PICA☆95Updated last year
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- 基于seq2edit (Gector) 的中文文本纠错。☆28Updated 2 years ago
- A repository for Chinese text normalization.☆16Updated 4 years ago
- ☆12Updated last year
- Finetune Bloom big language model with Lora method☆31Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆116Updated last month
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- 大语言模型训练和服务调研☆37Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆117Updated 6 months ago
- ☆37Updated last year
- ☆26Updated last year
- ☆11Updated 2 years ago
- dpo算法实现☆38Updated last year
- 高性能文本 Tokenizer 库☆29Updated last year
- baichuan and baichuan2 finetuning and alpaca finetuning☆32Updated 3 months ago