yanyiwu / simhash
中文 文档simhash值计算
☆1,106Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for simhash
- An Efficient Lexical Analyzer for Chinese☆795Updated last year
- 自动构建中文词库:http://www.matrix67.com/blog/archives/5044☆650Updated 11 months ago
- An Efficient Lexical Analyzer for Chinese☆2,024Updated 2 years ago
- A simple short-text classification tool based on LibLinear☆677Updated 3 years ago
- 中文语义分析、网络舆情、中文分词 资料☆498Updated 3 years ago
- Use C Api and Swig to Speed up jieba 高效的中文分词库☆632Updated 3 years ago
- A Python Implementation of Simhash Algorithm☆982Updated 2 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆212Updated 8 years ago
- Language Technology Platform☆4,969Updated last month
- 微信公众号语料库☆573Updated 5 years ago
- 这个项目是一个基本包.封装了大多数nlp项目中常用工具☆1,496Updated 7 months ago
- 百度NLP:分词,词性标注,命名实体识别,词重要性☆3,886Updated 3 years ago
- 2019-SOTA简繁中文拼写检查工具:FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)☆1,202Updated 2 years ago
- ☆3,413Updated last month
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Updated 4 years ago
- 商用级垃圾文本分类器☆405Updated 2 years ago
- 中文语句中的时间语义识别。即通过分析中文语句,识别出话语中提到的时间。☆629Updated 11 months ago
- pyltp: the python extension for LTP☆1,536Updated 2 years ago
- 百度开源的依存句法分析系统☆976Updated last year
- A Python wrapper around the NLPIR/ICTCLAS Chinese segmentation software.☆569Updated this week
- A Chinese Nature Language Toolkit☆1,679Updated 4 years ago
- 从中文文本中自动提取关键词和摘要☆3,286Updated 7 months ago
- Annotator for Chinese Text Corpus (UNDER DEVELOPMENT) 中文文本标注工具☆1,461Updated 7 months ago
- 中文Neural conversational model in Torch☆418Updated 3 years ago
- Deep Learning Chinese Word Segment☆2,083Updated 6 years ago
- Java porting of Darts (Double ARray Trie System)☆268Updated 6 years ago
- 中文分词☆3,136Updated 7 months ago
- FAQ-based Question Answering System☆2,585Updated 3 years ago
- A Toolkit for Industrial Topic Modeling☆2,637Updated 3 years ago