guokr / Caver
Caver: a toolkit for multilabel text classification.
☆39Updated 5 years ago
Alternatives and similar repositories for Caver:
Users that are interested in Caver are comparing it to the libraries listed below
- Python API for Various DB-Backed Simhash Clusters☆64Updated 7 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- SegPhrase working on Chinese and Arabic☆32Updated 8 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 5 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 7 years ago
- Pure python NLP toolkit☆55Updated 9 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆78Updated 11 years ago
- ☆36Updated 6 years ago
- tools for chinese word segmentation and pos tagging written in python☆38Updated 11 years ago
- Knowledge extraction from web data☆92Updated 6 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆60Updated 7 years ago
- python-segment是一个纯python实现的分词库,他的目标是提供一个可用的,完善的分词系统和训练环境,包括一个可用的词典。☆16Updated 11 years ago
- autocomplete-redis is a quora like automatic autocompletion based on redis.☆204Updated 11 years ago
- Python bloom filter using redis as a shared backend.☆19Updated 7 years ago
- Identify Events from text using Natural Language Processing Modules☆33Updated 8 years ago
- A simple scoring plugin for vector in Elasticsearch.☆69Updated 7 years ago
- Chinese Words Segment Library based on HMM model☆167Updated 10 years ago
- Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network☆21Updated 8 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆243Updated 12 years ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆136Updated 3 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 8 years ago
- Tobe Algorithm Manual☆49Updated 4 years ago
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆64Updated 4 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- An easy-install script for LibShortText☆27Updated 10 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆43Updated 9 years ago
- Calculate SimRank for a Networkx graph using the Delta Simrank method within MapReduce framework☆51Updated 8 years ago
- the Chinese NLP full stack toolkit☆41Updated 10 years ago
- ☆99Updated 10 years ago