brightgems / china_city_dataset
中国城市数据集
☆78Updated 3 years ago
Alternatives and similar repositories for china_city_dataset:
Users that are interested in china_city_dataset are comparing it to the libraries listed below
- 图书名语料库。含部分电影、游戏名称。☆71Updated last year
- 中文分词工具评估☆61Updated 2 years ago
- 常用中文停用词表及对比☆69Updated 6 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- 转换搜狗拼音词库为txt文件☆50Updated 7 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆114Updated 7 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 7 years ago
- 夸夸语料,来自豆瓣互相表扬组数据☆75Updated 6 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆195Updated 3 years ago
- 书籍《现代自然语言生成》介绍☆218Updated 4 years ago
- [译] Python 自然语言处理 第二版☆70Updated 4 years ago
- self complemented BaiduIndexSpyder based on Selenium , index image decode and num image transfer,基于关键词的历时百度搜索指数自动采集☆41Updated 6 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- ☆37Updated 5 years ago
- 一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a…☆150Updated 6 months ago
- 成语接龙☆48Updated 8 months ago
- 中文预处理语料☆108Updated 6 years ago
- 新词发现算法(NewWordDetection)☆62Updated 7 years ago
- 中文地址提取工具,支持中国三级区划地址(省、市、区)提取和映射,支持地址热力图绘制。☆222Updated 5 months ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 6 years ago
- 对红楼梦的各回目进行分类☆36Updated 7 years ago
- self complemented WeiboIndexSpyder based on Selenium ,新浪微博指数(微指数)采集,包括综合指数,移动端指数,PC端指数☆31Updated 6 years ago
- 中国各种公开的数据, 申万行业分类, 国民经济行业分类, 中国行政编码数据, 申银万国行业分类标准☆105Updated 6 years ago
- FastText 中文文档☆61Updated 4 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Updated 7 years ago
- 搜狐算法大赛:主要实体词情绪识别 baseline☆106Updated 6 years ago
- 中国法研杯-司法人工智能挑战赛☆91Updated 6 years ago
- 新冠期间,Springer Nature为教育界和学术界人士免费提供基础教科书的分类下载器☆9Updated 5 years ago
- 各大中文分词性能评测☆157Updated 6 years ago