漢語拆字字典
☆811Jan 8, 2023Updated 3 years ago
Alternatives and similar repositories for chaizi
Users that are interested in chaizi are comparing it to the libraries listed below
Sorting:
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆415Dec 29, 2025Updated 2 months ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆199Jul 17, 2021Updated 4 years ago
- 汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese charac…☆298Dec 29, 2025Updated 2 months ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆482Mar 28, 2024Updated last year
- 汉字自动拆分系统开发☆103Nov 2, 2023Updated 2 years ago
- 绝对有趣的中文发音引擎 funny chinese text to speech enginee☆52Sep 4, 2013Updated 12 years ago
- 中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。☆4,272Nov 9, 2025Updated 4 months ago
- 研究所有汉字的结构,为NLP中汉字结构问题提供完备的解。☆19Apr 7, 2024Updated last year
- IDS data for CJK Unified Ideographs☆486Feb 24, 2023Updated 3 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137May 25, 2020Updated 5 years ago
- 词语拼音数据☆516Jul 20, 2025Updated 8 months ago
- 同义词表,反义词表,否定词表☆542Oct 17, 2024Updated last year
- 汉字拼音数据☆1,441Feb 23, 2026Updated last month
- Free, open-source Chinese character data☆2,420Mar 8, 2026Updated 2 weeks ago
- 2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)☆1,225Sep 3, 2022Updated 3 years ago
- Custom Chinese input method with fcitx on Linux☆12Jul 16, 2020Updated 5 years ago
- 漢語拼音轉換表☆42Mar 24, 2021Updated 4 years ago
- 汉字转拼音(pypinyin)☆5,271Mar 8, 2026Updated 2 weeks ago
- A Slot-filling based Dialog Manager for Task-oriented Bot☆12Dec 29, 2016Updated 9 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆66Aug 26, 2018Updated 7 years ago
- 中华新华字典数据库。包括歇后语,成语,词语,汉字。☆11,502Dec 26, 2023Updated 2 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,581Nov 21, 2023Updated 2 years ago
- The hanzi similar tool.(汉字相似度计算工具,中文形近字算法。可用于手写汉字识别纠正,文本混淆等。)☆290Feb 28, 2024Updated 2 years ago
- Some useful Chinese corpus datasets 中文语料小数据☆546Mar 29, 2020Updated 5 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- mirror of dongxiexidian/Chinese☆304Dec 18, 2018Updated 7 years ago
- 汉字笔画库☆87Jan 8, 2021Updated 5 years ago
- 微信公众号语料库☆591Jan 7, 2019Updated 7 years ago
- Code for ACL 2021 paper. MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition.☆68Nov 4, 2021Updated 4 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,872Feb 6, 2026Updated last month
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆68Jun 21, 2018Updated 7 years ago
- SentiBridge: A Knowledge Base for Entity-Sentiment Representation☆644Sep 20, 2018Updated 7 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆129Jul 17, 2020Updated 5 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Mar 26, 2019Updated 6 years ago
- 汉字分析工具☆18Aug 20, 2018Updated 7 years ago
- THUOCL(THU Open Chinese Lexicon)中文词库☆1,031Apr 3, 2023Updated 2 years ago
- 甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon co…☆660Nov 2, 2021Updated 4 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,103Feb 1, 2026Updated last month
- 中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库☆738Feb 4, 2025Updated last year