提出基于划分的LDA主题模型 (PLDA)。对传统LDA模型进行改进,考虑中长篇文档篇章结构较为清晰,传统LDA在处理中长篇文档时不能识别每个篇章的主题,提出基于划分的LDA主题模型,对中长篇文档如新闻报道】国务院工作报告等按照段落进行划分,先拆后合,并将其效果与传统LDA、LSI及doc2vec进行比较。基于Sougou和Fudan语料库的分类实验验证了PLDA效果最优。
☆42Jul 8, 2019Updated 6 years ago
Alternatives and similar repositories for LDA-based-on-partition-PLDA-
Users that are interested in LDA-based-on-partition-PLDA- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于python gensim 库的LDA算法 对中文进行文本分析,很难得,网上都是英文的,基本上没有中文的,需要安装jieba分词进行分词,然后去除停用词最后才能使用LDA☆138Sep 7, 2019Updated 6 years ago
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆35Jul 29, 2018Updated 7 years ago
- 复现了论文《基于主题模型的短文本关键词抽取及扩展》的代码☆31Nov 11, 2020Updated 5 years ago
- 微信朋友圈LDA建模发现最常讨论的主题☆13May 4, 2016Updated 10 years ago
- Topic Analysis of Weibo News Based on Title-Weighted LDA Model ;基于标题加权的LDA模型的微博新闻主题分析;タイトル加重LDAモデルに基づくWeiboニューストピック分析☆15Oct 12, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 7 years ago
- lda模型的python实现☆31Aug 11, 2015Updated 10 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- SEBERTNets:一种面向金融领域的事件主体抽取方法☆194Jun 4, 2022Updated 3 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- textsum基于tensorflow实现的Seq2Seq-attention模型以及其他策略算法, 来解决摘要生成、主旨提取等(Text Summary)的任务。部分代码是在其他作者代码的基础上修改而来,后期将全部整理重构。☆30Sep 19, 2019Updated 6 years ago
- 用gensim训练LDA模型,进行新闻文本主题分析☆79May 17, 2019Updated 6 years ago
- Open web page extractor and keyword extractor for Chinese web pages☆20Aug 19, 2019Updated 6 years ago
- 无监督观点聚类。通过依存关系进行观点提取,对观点进行相似度计算,对已经生成的观点聚类☆47Jan 15, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- cropper js基于vue裁剪上传图片 功能☆13Nov 28, 2017Updated 8 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- 基于正则表达式和AC自动机多模匹配进行不规则处方文本理解,识别药品名、给药总量、用法用量等目标内容。☆15Jul 7, 2019Updated 6 years ago
- CADGMM (PAKDD2020)☆11Jan 2, 2021Updated 5 years ago
- 使用django对情感分析功能进行封装,里面包含使用情感词典和Bert模型进行情感分类,最后可以使用tensorFlow serving将模型部署在docker中运行。☆12Sep 23, 2019Updated 6 years ago
- This is source code of our Master Thesis on "Decode and Forward Relay Assisting Active Jamming in NOMA system".☆12Feb 26, 2024Updated 2 years ago
- CCKS 2020:面向金融领域的小样本跨类迁移事件抽取。该项目实现基于MRC的事件抽取方法☆39Oct 27, 2022Updated 3 years ago
- collection of pyTorch network structure☆11May 29, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 文本分类之特征选择☆11Aug 12, 2017Updated 8 years ago
- Collapsed Gibbs sampling for Latent Dirichlet Allocation☆18Jun 11, 2012Updated 13 years ago
- AI Challenger 2018 阅读理解赛道代码分享☆20Dec 6, 2018Updated 7 years ago
- Algorithms re-implementation for paper "Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approa…☆12Jan 7, 2023Updated 3 years ago
- 领域自适应文本挖掘工具(新词发现、情感分析、实体链接等),基于少量种子词和背景知识☆13Jun 19, 2019Updated 6 years ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- ☆12Dec 6, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- 该库是一个项目集,包括文本分类、多标签分类、细粒度情感分析、命名实体识别,以及部分数据集等☆202Mar 6, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Retrieving 'Topics' (concept) from corpus using (1) Latent Dirichlet Allocation (Genism) for modelling. Perplexity and Coherence score we…☆12Nov 2, 2018Updated 7 years ago
- ☆11Nov 12, 2024Updated last year
- 使用Python进行自然语言处理相关实践,如新词发现,主题模型,隐马尔模型词性标注,Word2Vec,情感分析☆52Jan 8, 2020Updated 6 years ago
- Codes, datasets, and features for Dynamic Collaborative Filtering with Aesthetic Feature (DCFA)☆10Nov 21, 2018Updated 7 years ago
- Code base for my master's thesis.☆19Feb 7, 2019Updated 7 years ago
- FEVER Workshop Shared-Task☆16Apr 16, 2019Updated 7 years ago
- Here present the Outage Probability, Achievable Rate of each user and Sum Rate of user 2*2 MIMO and 3*3 MIMO☆11Mar 18, 2023Updated 3 years ago