PMI, 是互信息(NMI)中的一种特例, 而互信息,是源于信息论中的一个概念,主要用于衡量2个信号的关联程度.至于PMI,是在文本处理中,用于计算两个词语之间的关联程度.比起传统的相似度计算, pmi的好处在于,从统计的角度发现词语共现的情况来分析出词语间是否存在语义相关 , 或者主题相关的情况.
☆15Aug 24, 2020Updated 5 years ago
Alternatives and similar repositories for PMI
Users that are interested in PMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 为提高推理速度优化代码,并在中文语料上复现RE2模型☆15Mar 24, 2023Updated 3 years ago
- 基于NER的文本纠错☆15Dec 27, 2023Updated 2 years ago
- 基于bert进行中文文本纠错☆242Jun 12, 2023Updated 2 years ago
- ☆25Mar 6, 2016Updated 10 years ago
- 使用word2vec, fasttext进行训练词向量☆11Jan 10, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Something Temporary☆10Oct 18, 2018Updated 7 years ago
- 基于内容相似度的房源推荐系统☆12Jul 4, 2021Updated 4 years ago
- 使用开源的Bert-as-Service预训练生成文档特征向量,基于k-means对COVID-19文献聚类,t-SNE可视化数据,通过LDA为每个簇生成主题关键词,画Bokeh图实现按簇、关键词搜索和筛选数据。☆19Aug 3, 2020Updated 5 years ago
- 将word2vec训练生成的词向量和BERT生成的词向量进行可视化对比☆15Jun 29, 2020Updated 5 years ago
- Code for NLPCC2016 Chinese Word Similarity Task☆17Sep 8, 2016Updated 9 years ago
- Self complemented Word Collocation using MI method which is tested to be effective..基于互信息算法的词语搭配抽取☆28Apr 5, 2018Updated 8 years ago
- 使用gensim训练word2vec模型并对训练得到词向量聚类☆16Sep 23, 2017Updated 8 years ago
- ☆19Feb 9, 2021Updated 5 years ago
- Summary of Text Classification in deep learning techniques implemented by PyTorch and TensorFlow. 深度学习文本分类技术总结,以PyTorch实现。☆14Dec 18, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 根据维基中文语料库预训练 GloVe 中文词向量;Pre-train GloVe word-embedding From Chinese Wiki corpus☆80Oct 13, 2023Updated 2 years ago
- 【Demo】找寻近义词的三种方法☆27Sep 21, 2020Updated 5 years ago
- ☆22Jul 2, 2021Updated 4 years ago
- 问题等价性判断数据预处理,包含添加对抗样本(同音字、近义词替换等)、获取样本的pattern(用通配符替换相同词汇,提取相同和不同词汇)☆39Dec 23, 2019Updated 6 years ago
- ☆15Sep 2, 2017Updated 8 years ago
- ☆32Sep 6, 2023Updated 2 years ago
- Hermes is a library built on top of TensorFlow 2 designed to provide simple, abstractions for natural language processing utilizing end t…☆18Jun 5, 2021Updated 4 years ago
- My implementation of 《Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory》☆34Oct 15, 2019Updated 6 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆216Mar 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 个性化推荐模型,主要包括als、als_wr、biaslfm、lfm、nmf、svdpp、基于内容、基于内容回归、user-cf、item-cf、slopeone、关联规则以及基于内容和cf的混合等模型。☆33Sep 11, 2022Updated 3 years ago
- 中文领域的多模态Bert☆47Mar 24, 2020Updated 6 years ago
- 简易的中文纠错和消歧