liuhuanyong / SinglepassTextCluster
SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for individual real-time corpus cluster task。基于single-pass算法思想的自动文本聚类小组件,内置tfidf和doc2vec两种文本向量方法,可自动输出聚类数目、类簇文档集合和簇类大小,用于自有实时数据的聚类任务。
☆62Updated 3 years ago
Alternatives and similar repositories for SinglepassTextCluster:
Users that are interested in SinglepassTextCluster are comparing it to the libraries listed below
- using lear to do ner extraction☆29Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆98Updated 2 years ago
- ☆88Updated 3 years ago
- A simple framework for building some basic NLP tasks☆59Updated 2 years ago
- ☆57Updated 2 years ago
- 基于 pytorch 的 bert 实现和下游任务微调☆50Updated 2 years ago
- CoSENT、STS、SentenceBERT☆163Updated last month
- 中文无监督SimCSE Pytorch实现☆133Updated 3 years ago
- 基于“Seq2Seq+前缀树”的知识图谱问答☆70Updated 3 years ago
- 中文bigbird预训练模型☆91Updated 2 years ago
- Pattern-Exploiting Training在中文上的简单实验☆170Updated 4 years ago
- 实验苏神的CoSENT的Torch实现☆32Updated 3 years ago
- ccks金融事件主体抽取☆72Updated 4 years ago
- ☆32Updated 3 years ago
- NLP实验:新词挖掘+预训练模型继续Pre-training☆47Updated last year
- WoBERT_pytorch☆40Updated 3 years ago
- ☆39Updated 3 years ago
- CCKS2020 面向中文短文本的实体链指任务。主要思路为:使用基于BiLSTM和Attention的语义模型进行Query和Doc的文本匹配,再针对匹配度进行pairwise排序,从而选出最优的知识库实体。☆47Updated 4 years ago
- 基于pytorch的百度UIE命名实体识别。☆57Updated 2 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆191Updated 2 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 关键词抽取项目☆24Updated 4 years ago
- GlobalPointer的优化版/NER实体识别☆116Updated 3 years ago
- 基于SpanBert的中文指代消解,pytorch实现☆97Updated 2 years ago
- 天池-新冠疫情相似句对判定大赛 Rank8☆52Updated 4 years ago
- 继续预训练中文bert☆30Updated 3 years ago
- pytorch Efficient GlobalPointer☆53Updated 2 years ago
- DataFountain第五届达观杯第4名方案☆50Updated 2 years ago
- 基于prompt的中文文本分类。☆54Updated last year
- Bert分类,语义相似度,获取句向量。☆65Updated this week