yangshoujian / Literature-Clustering-Bert
使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。
☆19Updated 4 years ago
Alternatives and similar repositories for Literature-Clustering-Bert:
Users that are interested in Literature-Clustering-Bert are comparing it to the libraries listed below
- BTM实现代码☆96Updated 2 years ago
- 利用Bert获取中文字、词向量☆10Updated 3 years ago
- 利用bert预训练模型生成句向量或词向量☆28Updated 4 years ago
- Clustering text with Bert☆56Updated 4 years ago
- 复现了论文《基于主题模型的短文本关键词抽取及扩展》的代码☆30Updated 4 years ago
- BTM in python☆72Updated 7 years ago
- 事件知识图谱构建相关的论文, 包含事件抽取、事件关系识别等任务☆82Updated last year
- Aspect Based Sentiment Analysis 基于方面的细粒度情感分析☆168Updated 2 years ago
- 提出基于划分的LDA主题模型 (PLDA)。对传 统LDA模型进行改进,考虑中长篇文档篇章结构较为清晰,传统LDA在处理中长篇文档时不能识别每个篇章的主题,提出基于划分的LDA主题模型,对中长篇文档如新闻报道】国务院工作报告等按照段落进行划分,先拆后合,并将其效果与传统LDA…☆38Updated 5 years ago
- multi-label,classifier,text classification,多标签文本分类,文本分类,BERT,ALBERT,multi-label-classification,seq2seq,attention,beam search☆32Updated 2 years ago
- Papers, models and datasets for Aspect-Category Sentiment Analysis.☆66Updated 2 years ago
- 一个简单的中文事件抽取模型,触发词和实体联合标注识别,同时判定实体角色。☆72Updated 4 years ago
- 训练词向量☆21Updated 4 years ago
- SMP 2020年微博情感分类评测任务 第六名解决方案☆68Updated 2 years ago
- TF-IDF+Word2vec做文本相似度计算,最好是长文本☆24Updated 5 years ago
- 中文文本预处理,Word2Vec训练计算文本相似度。☆45Updated 5 years ago
- pytorch implementation of multi-label text classification, includes kinds of models and pretrained. Especially for Chinese preprocessing.☆75Updated 5 years ago
- 多标签文本分类☆53Updated 5 years ago
- 根据褒贬种子词,利用SO-PMI构建情感词典☆25Updated 9 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- 基于CNN、RNN、GCN、BERT的中文文本分类☆43Updated last year
- 用gensim和TensorFlow训练word2vec中文词向量☆11Updated 5 years ago
- ☆34Updated 4 years ago
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆124Updated 6 years ago
- 指针生成网络的文本摘要☆18Updated 4 years ago
- Bert预训练模型fine-tune计算文本相似度☆100Updated last year
- COVID-19-sentiment-analysis-dataset-Weibo☆37Updated 4 years ago
- 篇章级事件抽取☆20Updated 4 years ago
- 这段代码会根据用户定义的正则表达式规则从文本中抽取属性三元组,我用它完成知识图谱构建的一个环节,使用方法写在README中,欢迎交流和指正!☆11Updated 3 years ago
- 事件抽取-目前限定火灾新闻☆20Updated 5 years ago