guokr / CaverLinks
Caver: a toolkit for multilabel text classification.
☆39Updated 6 years ago
Alternatives and similar repositories for Caver
Users that are interested in Caver are comparing it to the libraries listed below
Sorting:
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 11 years ago
- Pure python NLP toolkit☆55Updated 9 years ago
- A simple website demonstrating TextRank's extractive summarization capability.☆55Updated 4 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 6 years ago
- 把之前 hanLP-python-flask 裡面的 hanLP 單獨分出來☆59Updated 8 years ago
- a chinese segment base on crf☆234Updated 7 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 7 months ago
- Easily generate document/paragraph/sentence vectors and calculate similarity.☆137Updated 4 years ago
- tools for chinese word segmentation and pos tagging written in python☆38Updated 12 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 10 years ago
- Neutral Network based Chinese Segment System☆19Updated 9 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 12 years ago
- a text analyzing (match, rewrite, extract) engine (python edition)☆80Updated 8 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- ☆37Updated 7 years ago
- Paragraph Vector Implementation☆56Updated 8 years ago
- A cool (but quite useless) entity graph generator using Webhose.io☆29Updated 9 years ago
- Distributed text analysis suite based on Celery☆96Updated 3 years ago
- A Python package for pullword.com☆86Updated 5 years ago
- 2016CCF-sougou-code&PPT☆56Updated 9 years ago
- Python bloom filter using redis as a shared backend.☆19Updated 8 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Updated 8 years ago
- experimenting with elasticsearch features for vector fields☆20Updated 3 years ago
- SegPhrase working on Chinese and Arabic☆36Updated 9 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 9 years ago
- Detect duplicated items。内容排重框架。☆11Updated 10 years ago
- 中国高校更名记录合并☆13Updated 10 years ago
- A simple scoring plugin for vector in Elasticsearch.☆69Updated 8 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 9 years ago