PMI, 是互信息(NMI)中的一种特例, 而互信息,是源于信息论中的一个概念,主要用于衡量2个信号的关联程度.至于PMI,是在文本处理中,用于计算两个词语之间的关联程度.比起传统的相似度计算, pmi的好处在于,从统计的角度发现词语共现的情况来分析出词语间是否存在语义相关 , 或者主题相关的情况.
☆15Aug 24, 2020Updated 5 years ago
Alternatives and similar repositories for PMI
Users that are interested in PMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 为提高推理速度优化代码,并在中文语料上复现RE2模型☆15Mar 24, 2023Updated 3 years ago
- 基于NER的文本纠错☆15Dec 27, 2023Updated 2 years ago
- 基于bert进行中文文本纠错☆241Jun 12, 2023Updated 2 years ago
- ☆25Mar 6, 2016Updated 10 years ago
- 中文错别字纠正工具。音似、形似错字(或变体字)纠正,可用于中文拼音、笔画输入法的错误纠正。python开发。☆10Mar 5, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆322Jan 3, 2024Updated 2 years ago
- Code for the ACL'18 paper: A Neural Approach to Pun Generation☆18Jan 13, 2020Updated 6 years ago
- nlp分析工具是一款基于NLP开源算法和模型库(jieba、spacy、paddlenlp)对文本数据进行向量化,然后通过机器学习算法(聚类、主成分分析、图网络GraphicalLasso)对文本数据词向量之间进行关联性分析的小工具。☆18Nov 20, 2023Updated 2 years ago
- IMDB sentiment classification with GPT2 pre training☆14May 7, 2020Updated 5 years ago
- Discover new words from text by computing branch entropy and mutual information.☆10Mar 22, 2020Updated 6 years ago
- A python script to calculate normalized google distance (NGD). This is a semantic similarity metric based on Google search results☆18Dec 26, 2023Updated 2 years ago
- python 数地工厂 NLPSDK 关键词提取 摘要提取 新词发现 事件三元组提取 数据三元组提取 逻辑三元组提取 实体识别 短语组块识别 相似度计算 概念抽象 语义联想 情感极性判定 情感对提取 实体属性情感提取 主观性计算 网页正文解析 网页表格解析 实体链接 问题解…☆16Jan 26, 2021Updated 5 years ago
- Code for NLPCC2016 Chinese Word Similarity Task☆17Sep 8, 2016Updated 9 years ago
- Summary of Text Classification in deep learning techniques implemented by PyTorch and TensorFlow. 深度学习文本分类技术总结,以PyTorch实现。☆14Dec 18, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Sentential Semantic Similarity measurement library using BERT Embeddings for spatial distance evaluation.☆27Dec 14, 2020Updated 5 years ago
- ☆22Jul 2, 2021Updated 4 years ago
- 问题等价性 判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)☆39Dec 23, 2019Updated 6 years ago
- ☆15Sep 2, 2017Updated 8 years ago
- Using BERT+CRF model to do Chinese NER task☆27Sep 30, 2020Updated 5 years ago
- My implementation of 《Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory》☆34Oct 15, 2019Updated 6 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆215Mar 24, 2023Updated 3 years ago
- ☆31Jan 3, 2016Updated 10 years ago
- 细粒度的情感分析(属性词提取,句法依存分析)☆36Feb 23, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 新词发现算法与同义词挖掘☆27Oct 24, 2017Updated 8 years ago
- 热门推荐、基于内容推荐、基于用户协同过滤推荐、基于物品协同过滤推荐☆43Feb 2, 2019Updated 7 years ago
- 利用哈工大同义词林替换问答文本内的同义词进行语料扩充☆37Jun 6, 2019Updated 6 years ago
- 电商评论观点挖掘☆44Jan 29, 2021Updated 5 years ago
- ☆37Jan 5, 2018Updated 8 years ago
- Cloud Native Distributed Nearest Neighbour Search☆15Jun 9, 2020Updated 5 years ago
- Unsupervised Word Discovery☆10Jul 26, 2019Updated 6 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- This repository contains PyTorch implementations of the models from the paper An Empirical Study MIME: MIMicking Emotions for Empathetic …☆46Mar 14, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- rabitq rust implementation☆10Feb 4, 2026Updated last month
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Code for EMNLP 2019 paper "A Boundary-aware Neural Model for Nested Named Entity Recognition"☆89Jan 24, 2022Updated 4 years ago
- asyncio executors, clean and simple.☆10Nov 25, 2023Updated 2 years ago
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆12Mar 31, 2022Updated 3 years ago
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year