auto generate chinese words in huge text.
☆92Nov 25, 2014Updated 11 years ago
Alternatives and similar repositories for wordmaker
Users that are interested in wordmaker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- yaha☆265Sep 13, 2018Updated 7 years ago
- auto generate chinese words in huge text.☆23Jul 29, 2014Updated 11 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Oct 15, 2016Updated 9 years ago
- 新词发现算法(NewWordDetection)☆92Mar 22, 2021Updated 5 years ago
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆656Dec 5, 2023Updated 2 years ago
- wrap cppjieba by swig.☆20Mar 15, 2018Updated 8 years ago
- ☆16Nov 8, 2023Updated 2 years ago
- Simhash and near-duplicate detection☆17Dec 6, 2013Updated 12 years ago
- The simple header file library of CppJieba☆41Jun 7, 2015Updated 10 years ago
- spark处理大规模语料库统计词频☆41Apr 6, 2016Updated 9 years ago
- 农业知识图谱(KG):农业领域的信息检索,命名实体识别,关系抽取,分类树构建,数据挖掘☆14Apr 23, 2018Updated 7 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆500Sep 3, 2020Updated 5 years ago
- Build and visualize the word2vec model on sogou news data(SogouCS)☆13Mar 3, 2018Updated 8 years ago
- compare embedding☆238Sep 23, 2015Updated 10 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Sep 18, 2020Updated 5 years ago
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- 新词发现☆66May 30, 2014Updated 11 years ago
- 微博情感分析☆12Sep 1, 2013Updated 12 years ago
- Implementation of paper: Deng K, Bol P K, Li K J, et al. On the unsupervised analysis of domain-specific Chinese texts[J]. Proceedings of…☆78Aug 5, 2016Updated 9 years ago
- 基于TextRank和WordNet的中英文单文档自动摘要☆63Dec 18, 2015Updated 10 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆20May 9, 2020Updated 5 years ago
- 依存关系分析,NLP,自然语言处理☆85Oct 15, 2021Updated 4 years ago
- Chinese new word discovery☆43Aug 30, 2024Updated last year
- SWIG Wrapper for the SRILM toolkit☆35Oct 5, 2020Updated 5 years ago
- minitools☆104Jul 25, 2013Updated 12 years ago
- a demo site for jieba☆111Jul 29, 2013Updated 12 years ago
- 基于深度学习的中文分词尝试☆84Aug 27, 2015Updated 10 years ago
- hmm是实现命名实体识别,python 实现,对2014的人民日报语料进行按字切分,统计初始、转换、发射概率☆16Oct 19, 2017Updated 8 years ago
- Cross-platform process dependency monitor with GUI☆10Feb 23, 2026Updated last month
- solution for the 5th place of cikm cup 2014☆19Jan 28, 2015Updated 11 years ago
- ☆12Mar 1, 2019Updated 7 years ago
- 无字典中文关键字提取法☆11Nov 29, 2019Updated 6 years ago
- sequence labeling by neural network☆17Jun 10, 2017Updated 8 years ago
- 一个中文无字典分词程序☆42Sep 13, 2018Updated 7 years ago
- ☆15Sep 6, 2017Updated 8 years ago
- Evaluation of parallel regular expression matching on GPU☆15May 18, 2017Updated 8 years ago
- A Simpler GloVe model for distributed word representation☆86Aug 18, 2021Updated 4 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Jan 26, 2014Updated 12 years ago
- Code for "Smaller Text Classifiers with Discriminative Cluster Embeddings" (NAACL 2018)☆29Apr 16, 2018Updated 7 years ago