roliygu / CNKICrawler
A crawler of CNKI. It collects data for NLP and other ML/DL experiment.
☆33Updated 7 years ago
Alternatives and similar repositories for CNKICrawler:
Users that are interested in CNKICrawler are comparing it to the libraries listed below
- Get Data Reused☆20Updated 7 years ago
- Chinese Word Similarity Computation based on HowNet☆27Updated 7 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- Deep Learning for NLP resources☆17Updated 9 years ago
- Some articles written by Bao JieUpdated 8 years ago
- 微博搜索结果爬取工具☆27Updated 10 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 7 years ago
- A command-line manager programed in python, help with managing your local academic papers.☆89Updated 6 years ago
- A distributed Sina Weibo Search spider base on Scrapy and Redis.☆146Updated 11 years ago
- Topical Word Embeddings☆55Updated 7 years ago
- Sample of using proxies to crawl baidu search results.☆118Updated 7 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 7 years ago
- A Python package for pullword.com☆86Updated 4 years ago
- 基于深度学习的中文分词尝试☆84Updated 9 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Pure python NLP toolkit☆55Updated 9 years ago
- 《知网》中文词语语义相似度算法☆41Updated 11 years ago
- Using HashData to analyze a series of public available data.☆13Updated 8 years ago
- ☆43Updated 8 years ago
- 微博主题搜索分析,上海租房☆115Updated 8 years ago
- 2016CCF-sougou-code&PPT☆55Updated 8 years ago
- Different approaches to computing document similarity☆28Updated 8 years ago
- Generating Songci (Poetry of the Song Dynasty) by machine. Based on QuanSongci and RNN.☆36Updated 8 years ago
- This open source project is a python wrapper for NLPIR.☆82Updated 9 years ago
- a chinese segment base on crf☆233Updated 6 years ago
- sina weibo capture and sentiment classification☆53Updated 8 years ago
- A Wechat Bot - 废弃项目☆29Updated 3 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆113Updated 7 years ago
- BosonNLP HTTP API 封装库(SDK)☆164Updated 6 years ago
- ☆20Updated 8 years ago