rime-aca / corpus
古典中文語料庫
☆274Updated 2 years ago
Related projects: ⓘ
- 漢語拆字字典☆724Updated last year
- 汉语古典文本资料库☆239Updated 6 years ago
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征☆323Updated last month
- 《现代汉语词典》(第7版)全文TXT☆234Updated 2 months ago
- 中文相关词典和语料库。☆168Updated 10 years ago
- GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical Chinese (Literary Chinese)☆488Updated 3 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆88Updated 4 years ago
- this repo is a DB for Ancient Chinese Poems and Ancient Chinese Rhyme (Pronunciation).☆92Updated 9 years ago
- 微信公众号语料库☆569Updated 5 years ago
- 诗歌分析程序☆241Updated 7 years ago
- THUOCL(THU Open Chinese Lexicon)中文词库☆837Updated last year
- Dataset for couplets. 70万条对联数据库。☆709Updated 6 years ago
- NLU is hard!!!☆267Updated 5 years ago
- 词语拼音数据☆438Updated 6 months ago
- ☆52Updated 7 years ago
- A Chinese sentiment dataset may be useful for sentiment analysis.☆228Updated 7 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆566Updated 2 years ago
- Hanzi Converter for Traditional and Simplified Chinese☆180Updated 4 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆498Updated 4 years ago
- 《汉语大字典》字头检索表☆16Updated last year
- 一个中文词库☆343Updated 10 years ago
- 汉字自动拆分系统开发☆101Updated 10 months ago
- 近代汉语语料库数据集 自然语言处理 语料库 古代汉语 古汉语 文言文 数字人文 计算语言☆145Updated last year
- Simple conversion and localization between simplified and traditional Chinese using tables from MediaWiki.☆520Updated 5 months ago
- a char-RNN based on pytorch☆239Updated 7 years ago
- 单手笔顺输入法码表 Code table for Chinese stroke sequence (one hand) input method☆94Updated 3 months ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆113Updated 7 years ago
- 古诗词语料库☆120Updated 7 years ago