将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比
☆15Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for Word2vec-BERT-
Users that are interested in Word2vec-BERT- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- svm 模型训练 词袋的训练 微博情感分析 文本分类☆11Jan 28, 2018Updated 8 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- ☆11Apr 10, 2024Updated 2 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 12, 2022Updated 3 years ago
- 基于知识图谱的政务问答系统☆22May 30, 2021Updated 5 years ago
- NLP project, based on Bert/UNILM☆23Dec 29, 2019Updated 6 years ago
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- cnn bilstm crf 作中文命名实体识别☆13Sep 25, 2020Updated 5 years ago
- 政务公文知识图谱构建☆22Oct 12, 2022Updated 3 years ago
- tensorflow2.0 实现的 DCN (Deep & Cross Network) ,使用 Criteo 子数据集加以实践。☆15Aug 1, 2020Updated 5 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- 77,370条敏感文本和22,823个敏感词的高质量数据集,并进行分类☆18Mar 18, 2025Updated last year
- Latent Drichlet Allocation and Dynamic Topic Modeling☆10Aug 11, 2021Updated 4 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated 2 years ago
- 从白话文生成古诗。我的NLP期末课程项目☆14Nov 10, 2020Updated 5 years ago
- ☆32Sep 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Sep 27, 2021Updated 4 years ago
- 基于标题分类的主题句提取方法可描述为: 给定一篇新闻报道, 计算标题与新闻主题词集的相似度, 判断标题是否具有提示性。对于提示性标题,抽取新闻报道中与其最相似的句子作为主题句; 否则, 综合利用多种特征计算新闻报道中句子的重要性, 将得分最高的句子作为主题句。☆39Jul 26, 2016Updated 9 years ago
- Small tutorial on how you can use BERT for Topic Modeling☆18Jun 1, 2021Updated 4 years ago
- Classification of 6 viruses including covid-19 based on their DNA sequences using Transformers☆18Apr 18, 2023Updated 3 years ago
- US utility patent similarity data creation and analysis tools☆28Oct 28, 2020Updated 5 years ago
- 【python】利用百度语音识别API,百度语音合成API,图灵机器人API实现简单的对话机器人☆10Mar 13, 2021Updated 5 years ago
- ☆29May 7, 2025Updated last year
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- Nonlinear Granger causality using machine learning techniques☆22Sep 8, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 以nlp中情感分析为例,从没有打过标签的语料开始一步步教你怎么去打标签,然后分别以传统的机器学习模型SVM和深度学习模型LSTM去建模。☆36Jul 3, 2019Updated 6 years ago
- mRNN is an implementation of a Gated Recurrent Unit (GRU) network for classification of transcripts as either coding or noncoding.☆22Dec 6, 2018Updated 7 years ago
- 基于关键词搜索结果的微博爬虫☆31Nov 6, 2018Updated 7 years ago
- ☆24Dec 20, 2022Updated 3 years ago
- Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an E…☆22Feb 21, 2020Updated 6 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 4 years ago
- ☆21Mar 30, 2022Updated 4 years ago