PMI, 是互信息(NMI)中的一种特例, 而互信息,是源于信息论中的一个概念,主要用于衡量2个信号的关联程度.至于PMI,是在文本处理中,用于计算两个词语之间的关联程度.比起传统的相似度计算, pmi的好处在于,从统计的角度发现词语共现的情况来分析出词语间是否存在语义相关 , 或者主题相关的情况.
☆15Aug 24, 2020Updated 5 years ago
Alternatives and similar repositories for PMI
Users that are interested in PMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 为提高推理速度优化代码,并在中文语料上复现RE2模型☆15Mar 24, 2023Updated 3 years ago
- 基于NER的文本纠错☆15Dec 27, 2023Updated 2 years ago
- ☆25Mar 6, 2016Updated 10 years ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A simple implementation of Transformer Encoder in keras. This repository also includes an example of Transformer as a classifier and its …☆16Apr 9, 2019Updated 7 years ago
- 将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比☆15Jun 29, 2020Updated 5 years ago
- Self complemented Word Collocation using MI method which is tested to be effective..基于互信息算法的词语搭配抽取☆28Apr 5, 2018Updated 8 years ago
- 使用gensim训练word2vec模型并对训练得到词向量聚类☆16Sep 23, 2017Updated 8 years ago
- ☆19Feb 9, 2021Updated 5 years ago
- Summary of Text Classification in deep learning techniques implemented by PyTorch and TensorFlow. 深度学习文本分类技术总结,以PyTorch实现。☆14Dec 18, 2019Updated 6 years ago
- RUL Prognostics Method Based on Real Time Updating of LSTM Parameters☆24Apr 19, 2018Updated 8 years ago
- ☆22Jul 2, 2021Updated 4 years ago
- 问题等价性判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)☆39Dec 23, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Hermes is a library built on top of TensorFlow 2 designed to provide simple, abstractions for natural language processing utilizing end t…☆18Jun 5, 2021Updated 5 years ago
- Using BERT+CRF model to do Chinese NER task☆27Sep 30, 2020Updated 5 years ago
- My implementation of 《Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory》☆34Oct 15, 2019Updated 6 years ago
- 个性化推荐模型,主要包括als、als_wr、biaslfm、lfm、nmf、svdpp、基于内容、基于内容回归、user-cf、item-cf、slopeone、关联规则以及基于内容和cf的混合等模型。☆33Sep 11, 2022Updated 3 years ago
- 细粒度的情感分析(属性词提取,句法依存分析)☆36Feb 23, 2023Updated 3 years ago
- 新词发现算法与同义词挖掘☆27Oct 24, 2017Updated 8 years ago
- 利用哈工大同义词林替换问答文本内的同义词进行语料扩充☆37Jun 6, 2019Updated 7 years ago
- 电商评论观点挖掘☆44Jan 29, 2021Updated 5 years ago
- 简易的中文纠错和消歧☆289Aug 19, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆22Sep 3, 2018Updated 7 years ago
- 推荐算法学习☆39Mar 24, 2023Updated 3 years ago
- JDDC基线模型Seq2Seq☆43May 8, 2018Updated 8 years ago
- Unsupervised Word Discovery☆10Jul 26, 2019Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 6 years ago
- Code and dataset for our paper "Replicate, Walk, and Stop on Syntax: an Effective Neural Network Model for Aspect-Level Sentiment Classif…☆31Jul 25, 2024Updated last year
- Code for EMNLP 2019 paper "A Boundary-aware Neural Model for Nested Named Entity Recognition"☆89Jan 24, 2022Updated 4 years ago
- Codebase for Efficient yet simple Reinforcement Learning Research Framework☆28Jan 14, 2023Updated 3 years ago
- asyncio executors, clean and simple.☆10Nov 25, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆12Mar 31, 2022Updated 4 years ago
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- Server wrapper for ml models☆11Sep 11, 2019Updated 6 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago