jaaack-wang / Chinese-SynonymsLinks
A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。
☆64Updated 3 years ago
Alternatives and similar repositories for Chinese-Synonyms
Users that are interested in Chinese-Synonyms are comparing it to the libraries listed below
Sorting:
- 历届中文句法错误诊断技术评测数据集☆43Updated 3 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- 古文现代文翻译平行语料库☆110Updated 3 years ago
- 渊 - A project for Classical Chinese☆109Updated 3 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆95Updated 8 months ago
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆77Updated 5 years ago
- 中文机器阅读理解数据集☆107Updated 4 years ago
- CCL 2023 汉语学习者文本纠错评测☆29Updated 2 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆254Updated 3 months ago
- ☆59Updated 4 years ago
- 大规模中文语料☆44Updated 5 years ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆54Updated 2 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆78Updated 5 years ago
- 时间抽取、解析、标准化工具☆55Updated 3 years ago
- 中文纠错☆93Updated 3 years ago
- LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)☆219Updated 3 months ago
- CCL 2022 汉语学习者文本纠错评测☆142Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆76Updated last year
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆171Updated 4 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- PERT: Pre-training BERT with Permuted Language Model☆366Updated 3 months ago
- GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…☆190Updated last year
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Updated 4 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆175Updated 6 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆85Updated last year
- BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry☆158Updated 3 years ago
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆33Updated last year
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆75Updated 3 years ago