Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allocation (LDA), hyperparameters grid search and Topic Modeling visualiation.
☆44Jun 18, 2019Updated 6 years ago
Alternatives and similar repositories for topic-modelling
Users that are interested in topic-modelling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Computational Social Science Lab introducing basic concepts, research design, and tools for text analysis.☆16Sep 13, 2024Updated last year
- A collection of notebooks for Natural Language Processing☆25Jan 13, 2025Updated last year
- Transform a corpus of text documents (any kind) into a map with different zoom levels and topics names to summarise sub corpus of similar…☆29Jan 1, 2024Updated 2 years ago
- This is the repository for the files and documents used in the Smart Literature Review paper from (Boye, Møller, 2019)☆21May 16, 2022Updated 3 years ago
- In this notebook i will be demonstarting Latent Dirchlet Allocation(LDA) for topic modelling. I will be using the Amazon fine food review…☆46Jun 9, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Jan 26, 2022Updated 4 years ago
- The official source code for TaleBrush (CHI 2022)☆15Jul 13, 2022Updated 3 years ago
- ☆19May 13, 2022Updated 3 years ago
- Colab, MLflow and papermill are individually great. Together they form a dream team.☆10Jun 9, 2020Updated 5 years ago
- The Science knowledge graph ontologies, a.k.a. SKGO, is a suite of OWL ontology models to capture the knowledge of scientific research da…☆16Jul 3, 2025Updated 8 months ago
- Senior A.I. project to generate realistic news articles like those found on CNN, NYTimes, Fox News, etc. Future research will involve con…☆15Apr 26, 2019Updated 6 years ago
- Fragments-Expert is a software package for feature extraction from file fragments and classification among various file formats.☆13Jan 16, 2024Updated 2 years ago
- ☆23Jan 9, 2021Updated 5 years ago
- Converting the Enron email collection to mbox format☆11Dec 9, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reference list of email processing resources; focus on preservation and PII handling☆14Apr 20, 2022Updated 3 years ago
- a Mini App that provides a demo flow for request fictional rides (motorcycle, car, helicopter). User is presented with a map where they c…☆12Aug 6, 2024Updated last year
- ☆28Feb 18, 2018Updated 8 years ago
- Code Repository for Bash Scripting and Shell Programming (Linux Command Line), Published by Packt☆12Jan 30, 2023Updated 3 years ago
- Range facet/limit/profile plugin for Blacklight☆22Feb 6, 2026Updated last month
- ☆11Dec 2, 2024Updated last year
- This is an introduction to Chinese words segmentation using Jieba.☆14May 31, 2018Updated 7 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- A python package to explore pathways, diseases and drugs associated to a list of targets (genes, proteins, etc)☆20Mar 4, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Sample notebooks for using the Global Database of Events, Language and Tone (GDELT).☆19Nov 8, 2020Updated 5 years ago
- Exploring Jaccard and Cosine similarities performances then visualising their output using k means and kmeans with pca. Additional input …☆14Jan 26, 2021Updated 5 years ago
- Learning structural topic modeling using the stm R package.☆135Sep 23, 2017Updated 8 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Working paper and notebook for unsupervised document clustering☆13Mar 6, 2018Updated 8 years ago
- Creation of LDA (Latent Dirichlet Allocation) Topic Model on corpus of books harvested from Project Gutenberg☆27Apr 5, 2018Updated 7 years ago
- ☆30Jun 23, 2022Updated 3 years ago
- Prolog versions of the WordNet databases☆31Mar 9, 2026Updated 2 weeks ago
- ☆14Sep 27, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Python wrapper around the topic modeling functions of MALLET.☆106Nov 1, 2024Updated last year
- ParlaMint: Comparable Parliamentary Corpora☆77Nov 2, 2025Updated 4 months ago
- Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…☆26Nov 10, 2021Updated 4 years ago
- Harmonizing pathway databases using Biological Expression Language (BEL)☆20Jul 1, 2024Updated last year
- A Python implementation for training a neural network for predicting drug-protein interactions using Keras and Tensorflow☆18Jul 9, 2018Updated 7 years ago
- Ready to use blueprint project for creating and deploying a soulbound NFT collection contract on the TON blockchain using Tact programmin…☆12Sep 30, 2024Updated last year
- Using scrapy for meetups.com☆11Oct 7, 2015Updated 10 years ago