将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比
☆15Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for Word2vec-BERT-
Users that are interested in Word2vec-BERT- are comparing it to the libraries listed below
Sorting:
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- 一种用于序列标注任务的数据标注(分词,NER)的工具☆11Jun 3, 2020Updated 5 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- 对话集提取器是一个基于chatglm模型的工具,用于从文本中提取对话集。该工具可以帮助用户从小说、剧本等文本中自动提取出对话,以便进行分析、标注或其他应用。☆12Nov 22, 2024Updated last year
- Analyzing patent network data by downloading patentsview.org into MongoDB☆14Jun 21, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- svm 模型训练 词袋的训练 微博情感分析 文本分类☆11Jan 28, 2018Updated 8 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- 从0学习深度学习课程,跟随Andrew Ng的Coursera课程,课后根据记忆用python代码实现课程作业☆12Jan 14, 2020Updated 6 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆12Dec 14, 2024Updated last year
- Latent Drichlet Allocation and Dynamic Topic Modeling☆10Aug 11, 2021Updated 4 years ago
- CatIss is an intelligent tool for automatic categorization of issue reports based on the RoBERTa model.☆11Mar 8, 2022Updated 4 years ago
- My solution for Quora's Question Pair contest on Kaggle.☆10Jul 11, 2017Updated 8 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- 77,370条敏感文本和22,823个敏感词的高质量数据集,并进行分类☆16Mar 18, 2025Updated 11 months ago
- CCF大数据竞赛--垃圾短信基于文本内容的识别☆11Mar 13, 2016Updated 9 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- Dynamic Topic Modelling Tutorial Files☆13May 12, 2015Updated 10 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- 本次课程体系由复旦大学肖仰华教授策划,讲者为复旦大学、华为云、湖南大学、华东师范大学、上海财经大学、东华大学、苏州大学等青年学者。课程在国内多次巡回演讲,受到参会人员一致好评。 知识图谱课程全面系统讲授、研讨知识图谱相关概念与技术主题,对当前行业落地过程的一系列困难进行答…☆10Apr 24, 2020Updated 5 years ago
- ☆14Sep 17, 2020Updated 5 years ago
- Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.☆13Dec 14, 2021Updated 4 years ago
- 基于PySpark库,使用SparkSql连接MYSQL数据库并对数据进行统计分析的基础架构☆14Apr 24, 2018Updated 7 years ago
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- SpringBoot 结合 ElasticSearch 使用的一个简单Demo☆15Jun 21, 2022Updated 3 years ago
- 基于协同过滤算法的商品推荐引擎☆13Oct 28, 2020Updated 5 years ago
- cnn bilstm crf 作中文命名实体识别☆13Sep 25, 2020Updated 5 years ago
- 基于ES构建的一个简单的检索式问答系统,主要用来学习下python相关的ES操作☆13Dec 2, 2019Updated 6 years ago
- springBoot的简单整合neo4j☆12Jan 16, 2019Updated 7 years ago
- Community detection in patent co-citation network☆14Feb 4, 2019Updated 7 years ago
- 外包企业项目-顺风车管家☆17Aug 6, 2019Updated 6 years ago
- ☆17Jun 1, 2022Updated 3 years ago
- Small tutorial on how you can use BERT for Topic Modeling☆18Jun 1, 2021Updated 4 years ago
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago