xiaorancs / text-similarity
使用不同的方法计算相似度
☆42Updated 6 years ago
Alternatives and similar repositories for text-similarity:
Users that are interested in text-similarity are comparing it to the libraries listed below
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆64Updated 5 years ago
- Entity Linking,识别给定文本中出现的命名实体(Named Entity),并映射到特定的知识库中唯一的实体。包括命名实体识别、消歧等工作。☆71Updated 5 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 对中文分词jieba (python版)的注解☆92Updated 6 years ago
- 新词发现算法(NewWordDetection)☆62Updated 7 years ago
- Chinese Natural Language Processing tools and examples☆162Updated 9 years ago
- 新词发现☆66Updated 10 years ago
- Sentence Distance☆55Updated 6 years ago
- 基于TextRank和WordNet的中英文单文档自动摘要☆63Updated 9 years ago
- 《知网》中文词语语义相似度算法☆41Updated 11 years ago
- FastText 中文文档☆61Updated 4 years ago
- 基于深度学习的自然语言处理库☆156Updated 6 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆48Updated 8 years ago
- word2vec源码阅读,标记了中文注释☆61Updated 8 years ago
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆135Updated 4 years ago
- Details of paper cw2vec☆82Updated 6 years ago
- 使用simhash算法,快速索引和查询大量文本简历☆22Updated 9 years ago
- Code lab for NLP. Including doc2txt,tf-idf,cnn,text classify,hmm cws,crf ner.☆42Updated 7 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆216Updated 8 years ago
- 微信朋友圈LDA建模发现最常讨论的主题☆13Updated 9 years ago
- 中文文本自动纠错☆85Updated 6 years ago
- self implement of NLP toolkit 个人实现NLP汉语自然语言处理组件,提供基于HMM与CRF的分词,词性标注,命名实体识别接口,提供基于CRF的依存句法接口。☆55Updated 7 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- 基于深度学习的自然语言处理库☆37Updated 7 years ago
- ☆58Updated 3 years ago
- 依存句法实现关系三元组的自动抽取☆99Updated 3 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 7 years ago
- 利用深度学习实现中文分词☆61Updated 7 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago