liangqi / chinese-frequency-word-list
☆28Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for chinese-frequency-word-list
- Hanzi Converter for Traditional and Simplified Chinese☆180Updated 4 years ago
- 《现代汉语大词典》字词头☆26Updated 3 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆35Updated last year
- 《汉语大字典》字头检索表☆17Updated last year
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆74Updated 8 years ago
- ☆52Updated 7 years ago
- 中文繁体和简体字符对照表☆37Updated 2 years ago
- 《现代汉语词典》(第7版)全文TXT☆246Updated 4 months ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆92Updated 4 years ago
- Han character library for CJKV languages☆150Updated 3 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆329Updated 3 weeks ago
- 漢語拆字字典☆733Updated last year
- Python module that identifies Chinese text as being Simplified or Traditional☆86Updated last year
- Constants used in Chinese text processing☆359Updated last year
- 使用 pinyin-data 和 phrase-pinyin-data 中的拼音数据文件覆盖 pypinyin 中的内置拼音数据☆44Updated 8 months ago
- 開放漢語字典 - 現代漢語字音數據庫☆21Updated 4 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆142Updated 6 months ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- 词语拼音数据☆448Updated 8 months ago
- Somiao Pinyin: Train your own Chinese Input Method with Seq2seq Model 搜喵拼音输入法☆266Updated 4 years ago
- [本项目不再维护] 将汉字转换为拼音, 支持多音字,拼音 -> pin yin☆206Updated last year
- colordict词典库☆83Updated 10 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆85Updated 3 years ago
- Utility scripts or libraries for various Natural Language Processing tasks.☆39Updated 2 years ago
- ☆42Updated 2 years ago
- 收集非普通話漢語和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects☆187Updated this week
- A small package to fuzzy match chinese words☆78Updated last year
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆23Updated 6 years ago
- A tool for ancient Chinese segmentation.☆53Updated 5 years ago
- Chinese word segmentation module of LTP☆46Updated 9 years ago