Robust and fast topic models with sentence-transformers.
☆97Apr 4, 2026Updated this week
Alternatives and similar repositories for turftopic
Users that are interested in turftopic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Powerful topic model visualization in Python☆145Mar 19, 2025Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆43Sep 6, 2024Updated last year
- ☆10Jun 23, 2023Updated 2 years ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 9 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 4 months ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆150Jul 29, 2025Updated 8 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Blazing fast fuzzy text search for Python.☆51Apr 19, 2025Updated 11 months ago
- ☆12Apr 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆19Nov 21, 2024Updated last year
- Fast Multimodal Semantic Deduplication & Filtering☆909Jan 20, 2026Updated 2 months ago
- Embedding Vector Oriented Clustering☆212Apr 1, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 10 months ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆65Aug 6, 2025Updated 8 months ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Late Interaction Models Training & Retrieval☆778Mar 6, 2026Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆80Jan 6, 2026Updated 3 months ago
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Python library for calculating a large variety of metrics from text☆361Mar 20, 2026Updated 2 weeks ago
- The robust European language model benchmark.☆168Updated this week
- Mastering spaCy, Second Edition published by Packt☆24Feb 4, 2025Updated last year
- Pre-train Static Word Embeddings☆98Mar 27, 2026Updated last week
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Mar 1, 2026Updated last month
- Lightweight Nearest Neighbors with Flexible Backends☆336Updated this week
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 11 months ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions☆15Jun 16, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆115Sep 19, 2025Updated 6 months ago
- A data validation tool for MARC records☆27Mar 11, 2026Updated 3 weeks ago
- Creating beautiful plots of data maps☆993Mar 24, 2026Updated 2 weeks ago
- Bayesian probability transforms for BM25 retrieval scores☆72Mar 28, 2026Updated last week
- Fuzzy search modules for searching lists of words in low quality OCR and HTR text.☆23Mar 30, 2026Updated last week
- Shows how to encrypt data held in public space☆11Aug 11, 2017Updated 8 years ago
- Toolbox for non-linear calibration modeling.☆29Mar 30, 2026Updated last week