WZBSocialScienceCenter/tmtoolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WZBSocialScienceCenter/tmtoolkit)

WZBSocialScienceCenter / tmtoolkit

Text Mining and Topic Modeling Toolkit for Python with parallel processing power

☆191

Alternatives and similar repositories for tmtoolkit

Users that are interested in tmtoolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

olgasilyutina / stm_internet_regulation
View on GitHub
Analysis of Russian mass media articles about internet regulation with structural topic modeling
☆11May 15, 2018Updated 8 years ago
XinwenNI / LDA-DTM
View on GitHub
Latent Drichlet Allocation and Dynamic Topic Modeling
☆10Aug 11, 2021Updated 4 years ago
llefebure / un-general-debates
View on GitHub
Analysis and experiments on the UN General Debate corpus
☆37Apr 10, 2019Updated 7 years ago
yya518 / sparse-constrained-lda
View on GitHub
☆15Aug 22, 2016Updated 9 years ago
word-fish / wordfish-python
View on GitHub
extract relationships from standardized terms from corpus of interest with deep learning
☆19Dec 31, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
VaradPathak / DynamicLDA
View on GitHub
Dynamic Topic Modeling and Topic Chains of Reuters News Articles using SCVB0
☆24Jan 12, 2017Updated 9 years ago
wesslen / NCStateSenateFacebook
View on GitHub
Structural Topic Modeling of the Facebook posts of NC State Senators
☆13Mar 17, 2017Updated 9 years ago
adjidieng / ETM
View on GitHub
Topic Modeling in Embedding Spaces
☆561Oct 3, 2023Updated 2 years ago
chartbeat-labs / textacy
View on GitHub
NLP, before and after spaCy
☆2,239Sep 22, 2023Updated 2 years ago
Computational-Content-Analysis-2020 / Content-Analysis-2020
View on GitHub
Jupyter Notebooks to follow for each week
☆37Jun 12, 2020Updated 6 years ago
nikita-moor / ldatuning
View on GitHub
LDA models parameters tuning
☆78May 31, 2024Updated 2 years ago
n-waves / ulmfit4de
View on GitHub
ULMFiT Method for German Language
☆15May 10, 2019Updated 7 years ago
stephenhky / PyShortTextCategorization
View on GitHub
Various Algorithms for Short Text Mining
☆471Updated this week
BenjaminDHorne / The-NELA-Toolkit
View on GitHub
The News Landscape Toolkit (NELA)
☆16Oct 14, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
erre-quadro / spikex
View on GitHub
SpikeX - SpaCy Pipes for Knowledge Extraction
☆403Jul 30, 2021Updated 4 years ago
explosion / projects
View on GitHub
🪐 End-to-end NLP workflows from prototype to production
☆1,432Oct 15, 2024Updated last year
koaning / whatlies
View on GitHub
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
☆481Feb 6, 2023Updated 3 years ago
dkaslovsky / ElasticBatch
View on GitHub
Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames
☆21Dec 19, 2019Updated 6 years ago
MaartenGr / PolyFuzz
View on GitHub
Fuzzy string matching, grouping, and evaluation.
☆801Jul 10, 2025Updated last year
btwael / superstring.py
View on GitHub
A fast and memory-optimized string library for heavy-text manipulation in Python
☆251Apr 22, 2020Updated 6 years ago
argilla-io / spacy-wordnet
View on GitHub
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
☆261Aug 21, 2025Updated 11 months ago
jenojp / negspacy
View on GitHub
spaCy pipeline object for negating concepts in text
☆280Apr 20, 2026Updated 3 months ago
bab2min / tomotopy
View on GitHub
Python package of Tomoto, the Topic Modeling Tool
☆597Feb 21, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kevinlu1248 / pyate
View on GitHub
PYthon Automated Term Extraction
☆318Feb 8, 2023Updated 3 years ago
joewandy / hlda
View on GitHub
Gibbs sampler for the Hierarchical Latent Dirichlet Allocation topic model
☆153Feb 11, 2026Updated 5 months ago
jboynyc / textnets
View on GitHub
Text analysis with networks.
☆294May 14, 2026Updated 2 months ago
nfriedri / annie-annotation-platform
View on GitHub
☆31Apr 2, 2022Updated 4 years ago
machine-intelligence-laboratory / OptimalNumberOfTopics
View on GitHub
A set of methods for finding an appropriate number of topics in a text collection
☆15Apr 13, 2026Updated 3 months ago
MIND-Lab / OCTIS
View on GitHub
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
☆803Jun 21, 2026Updated last month
ddangelov / Top2Vec
View on GitHub
Top2Vec learns jointly embedded topic, document and word vectors.
☆3,101Nov 14, 2024Updated last year
inpho / topic-explorer
View on GitHub
System for building, visualizing, and working with LDA topic models
☆98Jan 22, 2026Updated 6 months ago
msg-systems / holmes-extractor
View on GitHub
Information extraction from English and German texts based on predicate logic
☆395Jul 8, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hyperquest-hq / hyperbase
View on GitHub
A foundational library for Semantic Hypergraphs
☆643Updated this week
lihait / CollocationFinder
View on GitHub
基于WordNet和句法依存树，实现对英语短语的搭配提取，包括连续的和非连续的英语短语词组。
☆10Jul 2, 2016Updated 10 years ago
lgalke / vec4ir
View on GitHub
Word Embeddings for Information Retrieval
☆227Oct 4, 2023Updated 2 years ago
piskvorky / gensim
View on GitHub
Topic Modelling for Humans
☆16,474Nov 1, 2025Updated 8 months ago
JasonKessler / scattertext
View on GitHub
Beautiful visualizations of how language differs among document types.
☆2,337Jul 4, 2026Updated 3 weeks ago
MilaNLProc / contextualized-topic-models
View on GitHub
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…
☆1,271Jul 24, 2025Updated last year
MartinoMensio / spacy-dbpedia-spotlight
View on GitHub
A spaCy wrapper for DBpedia Spotlight
☆110Mar 24, 2023Updated 3 years ago