rime-aca / corpus
古典中文語料庫
☆285Updated 2 years ago
Alternatives and similar repositories for corpus:
Users that are interested in corpus are comparing it to the libraries listed below
- 漢語拆字字典☆761Updated 2 years ago
- 汉语古典文本资料库☆270Updated 7 years ago
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆518Updated 3 years ago
- 微信公众号语料库☆579Updated 6 years ago
- 《现代汉语词典》(第7版)全文TXT☆262Updated 8 months ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆364Updated 4 months ago
- THUOCL(THU Open Chinese Lexicon)中文词库☆901Updated last year
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆535Updated 10 months ago
- NLU is hard!!!☆272Updated 5 years ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆153Updated this week
- 中文相关词典和语料库。☆172Updated 10 years ago
- 古诗词语料库☆128Updated 7 years ago
- ☆57Updated 7 years ago
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 词语拼音数据☆472Updated last month
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Updated 4 years ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆607Updated 3 years ago
- 单手笔顺输入法码表 Code table for Chinese stroke sequence (one hand) input method☆97Updated 8 months ago
- 汉字自动拆分系统开发☆102Updated last year
- Scrape poetry from gushiwen.org☆40Updated 8 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆183Updated 4 years ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆210Updated last year
- OpenCC made with Python☆548Updated last year
- Some useful Chinese corpus datasets 中文语料小数据☆532Updated 4 years ago
- 字词:收集国学/汉语字词拼音相关资源☆29Updated 6 years ago
- Dataset for couplets. 70万条对联数据库。☆728Updated last month
- 《汉语大字典》字头检索表☆18Updated 2 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆112Updated 7 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆452Updated 11 months ago
- this repo is a DB for Ancient Chinese Poems and Ancient Chinese Rhyme (Pronunciation).☆96Updated 10 years ago