nvliajia / Word2vec-BERT-View external linksLinks
将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比
☆15Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for Word2vec-BERT-
Users that are interested in Word2vec-BERT- are comparing it to the libraries listed below
Sorting:
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆27Oct 29, 2020Updated 5 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- 【python】利用百度语音识别API,百度语音合成API,图灵机器人API实现简单的对话机器人☆10Mar 13, 2021Updated 4 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- 对话集提取器是一个基于chatglm模型的工具,用于从文本中提取对话集。该工具可以帮助用户从小说、剧本等文本中自动提取出对话,以便进行分析、标注或其他应用。☆12Nov 22, 2024Updated last year
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆12Dec 14, 2024Updated last year
- 从0学习深度学习课程,跟随Andrew Ng的Coursera课程,课后根据记忆用python代码实现课程作业☆12Jan 14, 2020Updated 6 years ago
- ☆13Jul 12, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11May 18, 2021Updated 4 years ago
- Latent Drichlet Allocation and Dynamic Topic Modeling☆10Aug 11, 2021Updated 4 years ago
- Dynamic Topic Modelling Tutorial Files☆13May 12, 2015Updated 10 years ago
- 77,370条敏感文本和22,823个敏感词的高质量数据集,并进行分类☆14Mar 18, 2025Updated 10 months ago
- CCF大数据竞赛--垃圾短信基于文本内容的识别☆11Mar 13, 2016Updated 9 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- Implementation of "Optimizing neural networks for patent classification" paper☆14Jun 24, 2019Updated 6 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- 基于PySpark库,使用SparkSql连接MYSQL数据库并对数据进行统计分析的基础架构☆14Apr 24, 2018Updated 7 years ago
- 水果分类☆17Sep 11, 2023Updated 2 years ago
- 基于ES构建的一个简单的检索式问答系统,主要用来学习下python相关的ES操作☆13Dec 2, 2019Updated 6 years ago
- ☆15Aug 23, 2023Updated 2 years ago
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- 基于协同过滤算法的商品推荐引擎☆13Oct 28, 2020Updated 5 years ago
- ☆14Sep 17, 2020Updated 5 years ago
- NLP方向的论文代码复现☆14Jul 15, 2020Updated 5 years ago
- springBoot的简单整合neo4j☆13Jan 16, 2019Updated 7 years ago
- SpringBoot 结合 ElasticSearch 使用的一个简单Demo☆15Jun 21, 2022Updated 3 years ago
- tensorflow2.0 实现的 DCN (Deep & Cross Network) ,使用 Criteo 子数据集加以实践。☆15Aug 1, 2020Updated 5 years ago
- 从白话文生成古诗。我的NLP期末课程项目☆14Nov 10, 2020Updated 5 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- 外包企业项目-顺风车管家☆17Aug 6, 2019Updated 6 years ago
- ☆17Oct 19, 2021Updated 4 years ago
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago