将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比
☆15Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for Word2vec-BERT-
Users that are interested in Word2vec-BERT- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- 利用bert预训练模型生成句向量或词向量☆26Oct 29, 2020Updated 5 years ago
- ☆13Jan 14, 2021Updated 5 years ago
- 一种用于序列标注任务的数据标注(分词,NER)的工具☆11Jun 3, 2020Updated 5 years ago
- Implementation of "Optimizing neural networks for patent classification" paper☆14Jun 24, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Community detection in patent co-citation network☆14Feb 4, 2019Updated 7 years ago
- ☆13Jul 12, 2022Updated 3 years ago
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- PMI, 是互信息(NMI)中的一种特例, 而互信息,是源于信息论中的一个概念,主要用于衡量2个信号的关联程度.至于PMI,是在文本处理中,用于计算两个词语之间的关联程度.比起传统的相似度计算, pmi的好处在于,从统计的角度发现词语共现的情况来分析出词语间是否存在语义相关…☆15Aug 24, 2020Updated 5 years ago
- Self complemented Word Collocation using MI method which is tested to be effective..基于互信息算法的词语搭配抽取☆28Apr 5, 2018Updated 7 years ago
- 对话集提取器是一个基于chatglm模型的工具,用于从文本中提取对话集。该工具可以帮助用户从小说、剧本等文本中自动提取出对话,以便进行分析、标注或其他应用。☆12Nov 22, 2024Updated last year
- MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code☆15Jun 12, 2023Updated 2 years ago
- 中文微博评论情 感分类☆26May 11, 2019Updated 6 years ago
- Automatic missing value imputation using random forests☆14Aug 19, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- cnn bilstm crf 作中文命名实体识别☆13Sep 25, 2020Updated 5 years ago
- Dynamic Topic Modelling Tutorial Files☆13May 12, 2015Updated 10 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- 利用Bert获取中文字、词向量☆10Jan 18, 2022Updated 4 years ago
- Supplementary code for "News Frame Analysis: An Inductive Mixed-method Computational Approach" http://dx.doi.org/10.1080/19312458.2019.16…☆15Nov 13, 2020Updated 5 years ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆13Dec 14, 2024Updated last year
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- ☆17Jun 1, 2022Updated 3 years ago
- Demo for the calculation of the Semantic Brand Score (Basic Version)☆13Sep 1, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 基于标题分类的主题句提取方法可描述为: 给定一篇新闻报道, 计算标题与新闻主题词集的相似度, 判断标题是否具有提示性。对于提示性标题,抽取新闻报道中与其最相似的句子作为主题句; 否则, 综合利用多种特征计算新闻报道中句子的重要性, 将得分最高的句子作为主题句。☆40Jul 26, 2016Updated 9 years ago
- 水果分类☆17Sep 11, 2023Updated 2 years ago
- 专利信息及全文下载☆25Dec 27, 2022Updated 3 years ago
- pytorch implements of Deep & Cross Network for Ad Click Predictions from Google☆18Sep 3, 2018Updated 7 years ago
- 【python】利用百度语音识别API,百度语音合成API,图灵机器人API实现简单的对话机器人☆10Mar 13, 2021Updated 5 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- ChineseDiachronicCorpus,中文历时语料库,横跨六十余年,包括腾讯历时新闻2000-2016,人民日报历时语料1946-2003,参考消息历时语料1957-2002。基于历时流通语料库,可用于历时语言变化计算、语言监测、社会文化变迁研究提供基础性的语料支…☆23Jan 10, 2021Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"☆27May 18, 2024Updated last year
- WordBias: Visualizing Intersectional Social biases encoded in Word Embeddings☆23Aug 18, 2025Updated 7 months ago
- 以nlp中情感分析为例,从没有打过标签的语料开始一步步教你怎么去打标签,然后分别以传统的机器学习模型SVM和深度学习模型LSTM去建模。☆36Jul 3, 2019Updated 6 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- 基于关键词搜索结果的微博爬虫☆31Nov 6, 2018Updated 7 years ago
- ☆24Dec 20, 2022Updated 3 years ago
- ☆11May 12, 2023Updated 2 years ago