rime-aca / corpus
古典中文語料庫
☆278Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for corpus
- 漢語拆字字典☆737Updated last year
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆508Updated 3 years ago
- 汉语古典文本资料库☆249Updated 6 years ago
- 《现代汉语词典》(第7版)全文TXT☆249Updated 5 months ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆335Updated last month
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 词语拼音数据☆452Updated 8 months ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆146Updated last year
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Updated 4 years ago
- 微信公众号语料库☆573Updated 5 years ago
- 单手笔顺输入法码表 Code table for Chinese stroke sequence (one hand) input method☆96Updated 5 months ago
- THUOCL(THU Open Chinese Lexicon)中文词库☆858Updated last year
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- Some useful Chinese corpus datasets 中文语料小数据☆531Updated 4 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆181Updated 4 years ago
- NLU is hard!!!☆269Updated 5 years ago
- this repo is a DB for Ancient Chinese Poems and Ancient Chinese Rhyme (Pronunciation).☆92Updated 9 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆94Updated 4 years ago
- A Chinese sentiment dataset may be useful for sentiment analysis.☆230Updated 8 years ago
- OpenCC made with Python☆537Updated 11 months ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…