Robust and fast topic models with sentence-transformers.
☆95Mar 16, 2026Updated this week
Alternatives and similar repositories for turftopic
Users that are interested in turftopic are comparing it to the libraries listed below
Sorting:
- Blazing fast topic modelling for short texts.☆36Jan 5, 2026Updated 2 months ago
- Powerful topic model visualization in Python☆145Mar 19, 2025Updated last year
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 9 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 3 months ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆147Jul 29, 2025Updated 7 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Blazing fast fuzzy text search for Python.☆51Apr 19, 2025Updated 11 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- A Scandinavian Benchmark for sentence embeddings☆46Dec 5, 2025Updated 3 months ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆19Nov 21, 2024Updated last year
- Embedding Vector Oriented Clustering☆177Feb 26, 2026Updated 3 weeks ago
- Fast Multimodal Semantic Deduplication & Filtering☆897Jan 20, 2026Updated 2 months ago
- Bias correction for richness in abundance data☆12Aug 18, 2025Updated 7 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆26Nov 18, 2025Updated 4 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- HDBSCAN Tuning for BERTopic Models☆52Jun 5, 2023Updated 2 years ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- User-friendly viewer for Parquet files☆10Mar 7, 2026Updated last week
- Late Interaction Models Training & Retrieval☆743Mar 6, 2026Updated 2 weeks ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- The website for Danish Foundation Models, a project for training foundational Danish language model.☆81Jan 6, 2026Updated 2 months ago
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- LLM plugin for embeddings using sentence-transformers☆74Apr 23, 2025Updated 10 months ago
- A Python library for calculating a large variety of metrics from text☆361Jan 30, 2026Updated last month
- The robust European language model benchmark.☆164Updated this week
- Pre-train Static Word Embeddings☆95Sep 9, 2025Updated 6 months ago
- Mastering spaCy, Second Edition published by Packt☆23Feb 4, 2025Updated last year
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Mar 1, 2026Updated 2 weeks ago
- Lightweight Nearest Neighbors with Flexible Backends☆335Dec 30, 2025Updated 2 months ago
- A Rust library for accessing a Python AST using the Python ast library.☆15Aug 6, 2025Updated 7 months ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- DreamBank Visualized - An interactive visualization of over 26,000 dream transcriptions☆15Jun 16, 2018Updated 7 years ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆114Sep 19, 2025Updated 6 months ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- ☆107Jun 2, 2025Updated 9 months ago
- Bayesian probability transforms for BM25 retrieval scores☆58Updated this week