auto generate chinese words in huge text.
☆92Nov 25, 2014Updated 11 years ago
Alternatives and similar repositories for wordmaker
Users that are interested in wordmaker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- yaha☆265Sep 13, 2018Updated 7 years ago
- auto generate chinese words in huge text.☆23Jul 29, 2014Updated 11 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆96Oct 15, 2016Updated 9 years ago
- 新词发现算法(NewWordDetection)☆91Mar 22, 2021Updated 5 years ago
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆657Dec 5, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- wrap cppjieba by swig.☆20Mar 15, 2018Updated 8 years ago
- Simhash and near-duplicate detection☆17Dec 6, 2013Updated 12 years ago
- The simple header file library of CppJieba☆41Jun 8, 2026Updated last week
- spark处理大规模语料库统计词频☆41Apr 6, 2016Updated 10 years ago
- 农业知识图谱(KG):农业领域的信息检索,命名实体识别,关系抽取,分类树构建,数据挖掘☆14Apr 23, 2018Updated 8 years ago
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Sep 3, 2020Updated 5 years ago
- Build and visualize the word2vec model on sogou news data(SogouCS)☆13Mar 3, 2018Updated 8 years ago
- compare embedding☆238Sep 23, 2015Updated 10 years ago
- Hacky implementation of ppjoin by Chuan Xia et Al☆19Aug 24, 2014Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆23Sep 18, 2020Updated 5 years ago
- Chinese processing☆36Jan 29, 2014Updated 12 years ago
- 新词发现☆66May 30, 2014Updated 12 years ago
- 微博情感分析☆12Sep 1, 2013Updated 12 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Sep 5, 2018Updated 7 years ago
- 基于TextRank和WordNet的中英文单文档自动摘要☆63Dec 18, 2015Updated 10 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆20May 9, 2020Updated 6 years ago
- 依存关系分析,NLP,自然语言处理☆85Oct 15, 2021Updated 4 years ago
- SWIG Wrapper for the SRILM toolkit☆35Oct 5, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Chinese new word discovery☆43Aug 30, 2024Updated last year
- minitools☆104Jul 25, 2013Updated 12 years ago
- a demo site for jieba☆111Jul 29, 2013Updated 12 years ago
- 基于深度学习的中文分词尝试☆85Aug 27, 2015Updated 10 years ago
- hmm是实现命名实体识别,python 实现,对2014的人民日报语料进行按字切分,统计初始、转换、发射概率☆16Oct 19, 2017Updated 8 years ago
- solution for the 5th place of cikm cup 2014☆19Jan 28, 2015Updated 11 years ago
- ☆12Mar 1, 2019Updated 7 years ago
- Code for EMNLP2017 paper "A Soft-label Method for Noise-tolerant Distantly Supervised Relation Extraction"☆43Apr 17, 2018Updated 8 years ago
- 多种句子相似度算法☆36May 22, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 无字典中文关键字提取法☆11Nov 29, 2019Updated 6 years ago
- 微型中文关键词抽取服务☆56Nov 18, 2017Updated 8 years ago
- RNN(LSTM, GRU) in Theano with mini-batch training; character-level language models in Theano☆69Oct 14, 2018Updated 7 years ago
- movie ontology knowledge graph entity linking☆18Jan 19, 2016Updated 10 years ago
- "结巴"中文分词的C++版本,使用 darts Double Array Trie 降低内存占用到 1/100☆53Aug 26, 2022Updated 3 years ago
- A Simpler GloVe model for distributed word representation☆87Aug 18, 2021Updated 4 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Jan 26, 2014Updated 12 years ago