sfyc23 / python-wubi
汉字五笔转换工具
☆31Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-wubi
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆79Updated 2 years ago
- 利用文本分析算法和Python脚本,自动纠正word中的英语单词拼写错误☆46Updated 6 years ago
- 中文「四角号码」数据与工具,可以将汉字拆解成和字形相关的编码,在机器学习中作为汉字的字形特征☆25Updated 4 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆84Updated 6 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- 转换搜狗拼音词库为txt文件☆50Updated 7 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆189Updated 3 years ago
- 提取中文的偏旁部首和拼音(一些生僻字的拼音没有补全,待优化)☆40Updated 6 years ago
- 图书名语料库。含部分电影、游戏名称。☆66Updated 7 months ago
- colordict词典库☆83Updated 10 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆128Updated 4 years ago
- 物种名称语料库。植物名,动物名。☆41Updated 7 months ago
- 百度百科爬虫☆68Updated 5 months ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆58Updated 6 years ago
- Self complemented Word Collocation using MI method which is tested to be effective..基于互信息算法的词语搭配抽取☆29Updated 6 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- ☆37Updated 5 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆43Updated 9 years ago
- Translation model based on sequence to sequence model. 基于seq2seq模型的翻译模型demo☆17Updated 6 years ago
- 人民日报1998年1-4月中文标注语料库☆29Updated 6 years ago
- Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(fu…☆24Updated 3 years ago
- 《现代汉语大词典》字词头☆26Updated 3 years ago
- 中文分词工具评估☆59Updated last year
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆23Updated 5 years ago
- 汉字自动拆分系统开发☆102Updated last year
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆63Updated 6 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 7 years ago