Robust and fast topic models with sentence-transformers.
☆109Apr 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for turftopic
Users that are interested in turftopic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blazing fast topic modelling for short texts.☆36Apr 6, 2026Updated 3 weeks ago
- Powerful topic model visualization in Python☆146Mar 19, 2025Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆44Sep 6, 2024Updated last year
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆20Jun 17, 2025Updated 10 months ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆153Jul 29, 2025Updated 9 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated last year
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆19Nov 21, 2024Updated last year
- Fast Multimodal Semantic Deduplication & Filtering☆915Jan 20, 2026Updated 3 months ago
- Bias correction for richness in abundance data☆12Apr 20, 2026Updated last week
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 11 months ago
- ☆29Nov 18, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 11 months ago
- HDBSCAN Tuning for BERTopic Models☆52Jun 5, 2023Updated 2 years ago
- A Python library for graph coloring☆15Feb 13, 2026Updated 2 months ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Late Interaction Models Training & Retrieval☆796Updated this week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆80Apr 20, 2026Updated last week
- FlexiTokens☆20Dec 27, 2025Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The robust European language model benchmark.☆175Apr 17, 2026Updated last week
- Pre-train Static Word Embeddings☆99Mar 27, 2026Updated last month
- Lightweight Nearest Neighbors with Flexible Backends☆336Apr 16, 2026Updated last week
- A repository containing the materials required to complete the "AAAI Lab for Innovative Uses of Synthetic Data". This includes tutorials …☆12Sep 4, 2024Updated last year
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆115Sep 19, 2025Updated 7 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- A french litbank corpus☆10Jan 22, 2026Updated 3 months ago
- Shows how to encrypt data held in public space☆11Aug 11, 2017Updated 8 years ago
- Toolbox for non-linear calibration modeling.☆29Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Oct 18, 2023Updated 2 years ago
- Algorithms for generating synthetic data☆17Jun 18, 2024Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆140Dec 28, 2024Updated last year
- 14 million, semi-supervised, mental disorder detection data.☆15Oct 23, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 11 months ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.☆7,564Feb 20, 2026Updated 2 months ago
- ☆33Updated this week