Topic Modelling for Humans
β16,373Nov 1, 2025Updated 4 months ago
Alternatives and similar repositories for gensim
Users that are interested in gensim are comparing it to the libraries listed below
Sorting:
- π« Industrial-strength Natural Language Processing (NLP) in Pythonβ33,283Updated this week
- Library for fast text representation and classification.β26,501Mar 22, 2024Updated last year
- NLTK Sourceβ14,539Updated this week
- An open-source NLP research library, built on PyTorch.β11,891Nov 22, 2022Updated 3 years ago
- Deep Learning for humansβ63,869Mar 3, 2026Updated last week
- A very simple framework for state-of-the-art Natural Language Processing (NLP)β14,355Oct 27, 2025Updated 4 months ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on singβ¦β28,066Updated this week
- TensorFlow code and pre-trained models for BERTβ39,879Jul 23, 2024Updated last year
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.β9,514Updated this week
- Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddingsβ7,192Jul 27, 2025Updated 7 months ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the moβ¦β22,981Jul 28, 2024Updated last year
- A library for efficient similarity search and clustering of dense vectors.β39,255Updated this week
- scikit-learn: machine learning in Pythonβ65,341Updated this week
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to diskβ14,175Oct 29, 2025Updated 4 months ago
- Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.β8,858Jun 10, 2024Updated last year
- Models and examples built with TensorFlowβ77,691Updated this week
- CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.β10,059Feb 10, 2026Updated 3 weeks ago
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β157,462Updated this week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used β¦β18,124Mar 3, 2026Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,176Sep 30, 2025Updated 5 months ago
- β3,171Nov 16, 2021Updated 4 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP)β18,237Feb 7, 2026Updated last month
- State-of-the-Art Text Embeddingsβ18,364Updated this week
- Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languagesβ7,733Updated this week
- π Scalable embedding, reasoning, ranking for images and sentences with CLIPβ12,820Jan 23, 2024Updated 2 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juliβ¦β20,828Oct 25, 2023Updated 2 years ago
- Unsupervised text tokenizer for Neural Network-based text generation.β11,677Mar 1, 2026Updated last week
- The fastai deep learning libraryβ27,887Feb 26, 2026Updated last week
- A natural language modeling framework based on PyTorchβ6,305Oct 17, 2022Updated 3 years ago
- Parallel computing with task schedulingβ13,760Updated this week
- Oxford Deep NLP 2017 courseβ15,859Jul 2, 2023Updated 2 years ago
- A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learningβ7,087Feb 2, 2026Updated last month
- An Open Source Machine Learning Framework for Everyoneβ193,952Updated this week
- Lime: Explaining the predictions of any machine learning classifierβ12,099Jul 25, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU accelerationβ98,039Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,617Updated this week
- A system for quickly generating training data with weak supervisionβ5,941May 2, 2024Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.β1,846Dec 4, 2025Updated 3 months ago
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arraysβ¦β9,984Jan 15, 2024Updated 2 years ago