Robust and fast topic models with sentence-transformers.
☆94Feb 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for turftopic
Users that are interested in turftopic are comparing it to the libraries listed below
Sorting:
- Powerful topic model visualization in Python☆144Mar 19, 2025Updated 11 months ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆142Jul 29, 2025Updated 7 months ago
- Tools for interactive visual exploration of semantic embeddings.☆42Sep 6, 2024Updated last year
- DImensionality REduction in JAX☆25Nov 21, 2025Updated 3 months ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 8 months ago
- Zero/few shot learning components for scikit-learn pipelines with LLMs and transformers.☆19Nov 21, 2024Updated last year
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Apr 15, 2024Updated last year
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- Fast Multimodal Semantic Deduplication & Filtering☆890Jan 20, 2026Updated last month
- Blazing fast fuzzy text search for Python.☆51Apr 19, 2025Updated 10 months ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- Embedding Vector Oriented Clustering☆173Feb 4, 2026Updated 3 weeks ago
- Implemented transformer NN block for Machine translation, text classfication, Natural language inference as well as Machine reading compr…☆11Dec 27, 2025Updated 2 months ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- Algorithms for generating synthetic data☆16Jun 18, 2024Updated last year
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Aug 15, 2024Updated last year
- Mastering spaCy, Second Edition published by Packt☆23Feb 4, 2025Updated last year
- ☆26Nov 18, 2025Updated 3 months ago
- Late Interaction Models Training & Retrieval☆721Feb 18, 2026Updated last week
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆114Sep 19, 2025Updated 5 months ago
- Simple scripts to generate and use an Annoy index and lmdb map☆28Jan 4, 2018Updated 8 years ago
- ☆107Jun 2, 2025Updated 8 months ago
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆52Apr 23, 2025Updated 10 months ago
- HDBSCAN Tuning for BERTopic Models☆52Jun 5, 2023Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆140Dec 28, 2024Updated last year
- The robust European language model benchmark.☆161Updated this week
- Efficient BM25 with DuckDB 🦆☆64Dec 20, 2024Updated last year
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Sep 24, 2022Updated 3 years ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 5 months ago
- ☆47Feb 7, 2024Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆36Jan 5, 2026Updated last month
- Lightweight Nearest Neighbors with Flexible Backends☆334Dec 30, 2025Updated 2 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆63Jul 6, 2025Updated 7 months ago
- ☆20May 29, 2016Updated 9 years ago
- Concept Modeling: Topic Modeling on Images and Text☆220Nov 4, 2024Updated last year