anjuke / pinyin4py
汉字转拼音
☆44Updated 9 years ago
Alternatives and similar repositories for pinyin4py:
Users that are interested in pinyin4py are comparing it to the libraries listed below
- 一个中文无字典分词程序☆39Updated 6 years ago
- auto generate chinese words in huge text.☆91Updated 10 years ago
- A Python package for pullword.com☆86Updated 4 years ago
- 基于行块分布函数的通用网页正文(及图片)抽取 - Python版本☆115Updated 8 years ago
- 《基于行块分布函数的通用网页正文抽取》的Python实现方式☆30Updated 10 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆112Updated 7 years ago
- 一个中文词库☆347Updated 10 years ago
- BosonNLP HTTP API 封装库(SDK)☆164Updated 6 years ago
- 正文提取|extract content from html☆22Updated 7 years ago
- deepThought is a conversational smart bot☆110Updated 8 years ago
- Thank-you-follow-me Ha Ha Ha!☆42Updated 9 years ago
- An OCR client use Baidu API☆54Updated 7 years ago
- SNS用户交互学习行为研究☆45Updated 10 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆23Updated 6 years ago
- yet another python crawler☆31Updated 11 years ago
- yaha☆266Updated 6 years ago
- 中文相关词典和语料库。☆172Updated 10 years ago
- 对红楼梦的各回目进行分类☆36Updated 7 years ago
- 搜狗、百度、QQ输入法的词库文件的 Java 解析程序,配合 ThesaurusSpider 使用☆107Updated 5 years ago
- spark处理大规模语料库统计词频☆40Updated 8 years ago
- ⛔ [DEPRECATED] URL2io Python SDK,用于网页信息提取,如正文提取☆41Updated 4 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- a chinese segment base on crf☆233Updated 6 years ago
- A Chinese Webpage Title Text Categorization Tool 中文网页标题分类工具(短文本分类) pure c/c++ version: https://github.com/MagnusBai/webpage_categorizati…☆20Updated 7 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆65Updated 6 years ago
- 网页内容生成word cloud☆10Updated 7 years ago
- 图书名语料库。含部分电影、游戏名称。☆71Updated 11 months ago
- 自动修正中文、英文、代码混合排版中的全半角、空格等问题☆97Updated 3 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 7 years ago
- the Chinese NLP full stack toolkit☆41Updated 10 years ago