liuhuanyong / SinglepassTextClusterLinks
SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for individual real-time corpus cluster task。基于single-pass算法思想的自动文本聚类小组件,内置tfidf和doc2vec两种文本向量方法,可自动输出聚类数目、类簇文档集合和簇类大小,用于自有实时数据的聚类任务。
☆63Updated 3 years ago
Alternatives and similar repositories for SinglepassTextCluster
Users that are interested in SinglepassTextCluster are comparing it to the libraries listed below
Sorting:
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆99Updated 2 years ago
- using lear to do ner extraction☆29Updated 3 years ago
- A simple framework for building some basic NLP tasks☆59Updated 2 years ago
- ☆87Updated 3 years ago
- benchmark of KgCLUE, with different models and methods☆27Updated 3 years ago
- 中文无监督SimCSE Pytorch实现☆134Updated 3 years ago
- ☆32Updated 3 years ago
- CCKS2020 面向中文短文本的实体链指任务。主要思路为:使用基于BiLSTM和Attention的语义模型进行Query和Doc的文本匹配,再针对匹配度进行pairwise排序,从而选出最优的知识库实体。☆47Updated 4 years ago
- ☆57Updated 2 years ago
- CoSENT、STS、SentenceBERT☆168Updated 3 months ago
- NLP实验:新词挖掘+预训练模型继续Pre-training☆47Updated last year
- 基于 pytorch 的 bert 实现和下游任务微调☆51Updated 2 years ago
- pytorch Efficient GlobalPointer☆54Updated 3 years ago
- ccks金融事件主体抽取☆72Updated 4 years ago
- TIANCHI-小布助手对话短文本语义匹配BERT baseline☆32Updated 4 years ago
- 中文bigbird预训练模型☆92Updated 2 years ago
- 中国中文信息学会社会媒体处理专业委员会举办的2019届中文人机对话之自然语言理解竞赛☆74Updated 5 years ago
- ☆40Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 实验苏神的CoSENT的Torch实现☆32Updated 3 years ago
- implementation SlotGated SLU model for keras☆34Updated 4 years ago
- experiments of some semantic matching models and comparison of experimental results.☆161Updated last year
- Pattern-Exploiting Training在中文上的简单实验☆171Updated 4 years ago
- 基于PaddleNLP开源的抽取式UIE进行医学命名实体识别(torch实现)☆43Updated 2 years ago
- ☆91Updated 5 years ago
- ☆127Updated 2 years ago
- Bert分类,语义相似度,获取句向量。☆64Updated 2 months ago
- 中文数据集下SimCSE+ESimCSE的实现☆192Updated 3 years ago
- GAIIC2022商品标题实体识别Baseline,使用GlobalPointer实现,线上0.80349☆53Updated 3 years ago
- P-tuning方法在中文上的简单实验☆139Updated 4 years ago