v-zich / couplet-clean-dataset
Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。
☆72Updated 5 years ago
Alternatives and similar repositories for couplet-clean-dataset:
Users that are interested in couplet-clean-dataset are comparing it to the libraries listed below
- 中文纠错☆92Updated 3 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆134Updated 4 years ago
- Pytorch model for https://github.com/imcaspar/gpt2-ml☆79Updated 3 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- 时间抽取、解析、标准化工具☆51Updated 2 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆194Updated 3 years ago
- 李傲龍的博客☆81Updated 8 months ago
- 基于bert进行中文文本纠错☆232Updated last year
- ☆34Updated 3 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆172Updated 6 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago
- 大规模中文语料☆40Updated 5 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆74Updated 4 years ago
- 历届中文句法错误诊断技术评测数据集☆38Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆84Updated last month
- 中文近义词表 Chinese Synonyms☆252Updated 7 years ago
- ☆102Updated 4 years ago
- 中文心理问答数据集☆75Updated 4 years ago
- CCL 2023 汉语学习者文本纠错评测☆28Updated last year
- CCL 2022 汉语学习者文本纠错评测☆138Updated 2 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆107Updated last year
- 零样本学习测评基准,中文版☆56Updated 3 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆159Updated 3 years ago
- ☆218Updated 2 years ago
- Gaokao Benchmark for AI☆108Updated 2 years ago
- 中文生成式预训练模型☆98Updated 4 years ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- ☆46Updated 4 years ago