Robust and fast topic models with sentence-transformers.
☆111Apr 13, 2026Updated last month
Alternatives and similar repositories for turftopic
Users that are interested in turftopic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Powerful topic model visualization in Python☆148Mar 19, 2025Updated last year
- ☆10Jun 23, 2023Updated 2 years ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆20Jun 17, 2025Updated 11 months ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 5 months ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆156Jul 29, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Blazing fast fuzzy text search for Python.☆52Apr 19, 2025Updated last year
- KalDB is a cloud-native polystore for search and analytics☆39May 8, 2026Updated last week
- ☆12Apr 9, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156May 24, 2024Updated last year
- A Scandinavian Benchmark for sentence embeddings☆45Dec 5, 2025Updated 5 months ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆19Nov 21, 2024Updated last year
- Fast Multimodal Semantic Deduplication & Filtering☆926May 4, 2026Updated 2 weeks ago
- Bias correction for richness in abundance data☆12Apr 20, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated 11 months ago
- ☆29Nov 18, 2025Updated 6 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 11 months ago
- HDBSCAN Tuning for BERTopic Models☆52Jun 5, 2023Updated 2 years ago
- ANE accelerated embedding models!☆19Dec 11, 2024Updated last year
- State-of-the-art paired encoder and decoder models (17M-1B params)☆69Aug 6, 2025Updated 9 months ago
- Late Interaction Models Training & Retrieval☆811May 11, 2026Updated last week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- FlexiTokens☆22Dec 27, 2025Updated 4 months ago
- LLM plugin for embeddings using sentence-transformers☆73Apr 23, 2025Updated last year
- A Python library for calculating a large variety of metrics from text☆363May 5, 2026Updated 2 weeks ago
- The robust European language model benchmark.☆176Updated this week
- Mastering spaCy, Second Edition published by Packt☆24Feb 4, 2025Updated last year
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Mar 1, 2026Updated 2 months ago
- Pre-train Static Word Embeddings☆101May 4, 2026Updated 2 weeks ago
- Lightweight Nearest Neighbors with Flexible Backends☆341May 4, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆49Jan 7, 2021Updated 5 years ago
- A repository containing the materials required to complete the "AAAI Lab for Innovative Uses of Synthetic Data". This includes tutorials …☆12Sep 4, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions☆16Jun 16, 2018Updated 7 years ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆117Sep 19, 2025Updated 8 months ago
- A Rust library for accessing a Python AST using the Python ast library.☆16Apr 30, 2026Updated 2 weeks ago
- Creating beautiful plots of data maps☆1,008Updated this week