explosion/floret

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/explosion/floret)

explosion / floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

☆343

Alternatives and similar repositories for floret

Users that are interested in floret are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NorskRegnesentral / skweak
View on GitHub
skweak: A software toolkit for weak supervision applied to NLP tasks
☆925Sep 2, 2024Updated last year
pmbaumgartner / spacy-html-tokenizer
View on GitHub
☆69Mar 17, 2022Updated 4 years ago
explosion / spacy-experimental
View on GitHub
🧪 Cutting-edge experimental spaCy components and features
☆104Apr 23, 2024Updated 2 years ago
KennethEnevoldsen / augmenty
View on GitHub
Augmenty is an augmentation library based on spaCy for augmenting texts.
☆156May 24, 2024Updated 2 years ago
webis-de / small-text
View on GitHub
Active Learning for Text Classification in Python
☆646May 24, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
explosion / spacy-transformers
View on GitHub
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
☆1,408Mar 27, 2026Updated 3 months ago
erre-quadro / spikex
View on GitHub
SpikeX - SpaCy Pipes for Knowledge Extraction
☆403Jul 30, 2021Updated 4 years ago
davidberenstein1957 / concise-concepts
View on GitHub
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…
☆244Jun 19, 2023Updated 3 years ago
koaning / embetter
View on GitHub
just a bunch of useful embeddings for scikit-learn pipelines
☆527Feb 12, 2026Updated 5 months ago
davidberenstein1957 / classy-classification
View on GitHub
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…
☆221Jan 20, 2025Updated last year
koaning / doubtlab
View on GitHub
Doubt your data, find bad labels.
☆515Jul 15, 2024Updated 2 years ago
msg-systems / coreferee
View on GitHub
Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…
☆198Dec 18, 2022Updated 3 years ago
koaning / whatlies
View on GitHub
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
☆481Feb 6, 2023Updated 3 years ago
infinitylogesh / mutate
View on GitHub
A library to synthesize text datasets using Large Language Models (LLM)
☆152Jan 17, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
koaning / scikit-partial
View on GitHub
Pipeline components that support partial_fit.
☆46Jul 15, 2024Updated 2 years ago
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,772May 26, 2026Updated last month
KennethEnevoldsen / asent
View on GitHub
Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.
☆116Oct 20, 2025Updated 9 months ago
pmbaumgartner / spacy-setfit-textcat
View on GitHub
☆29Jun 23, 2022Updated 4 years ago
explosion / spacy-llm
View on GitHub
🦙 Integrating LLMs into structured NLP pipelines
☆1,392Mar 27, 2026Updated 3 months ago
explosion / sense2vec
View on GitHub
🦆 Contextually-keyed word vectors
☆1,678Mar 27, 2026Updated 3 months ago
R1j1t / contextualSpellCheck
View on GitHub
✔️Contextual word checker for better suggestions (not actively maintained)
☆419Jan 31, 2025Updated last year
explosion / spacy-streamlit
View on GitHub
👑 spaCy building blocks and visualizers for Streamlit apps
☆858Jul 29, 2024Updated last year
richardpaulhudson / holmes-extractor
View on GitHub
Information extraction from English and German texts based on predicate logic
☆144Jun 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gandersen101 / spaczz
View on GitHub
Fuzzy matching and more functionality for spaCy.
☆258Jul 6, 2024Updated 2 years ago
HazyResearch / bootleg
View on GitHub
Self-Supervision for Named Entity Disambiguation at the Tail
☆218Jun 14, 2022Updated 4 years ago
Lucaterre / spacyfishing
View on GitHub
A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata
☆173Nov 7, 2022Updated 3 years ago
koaning / icepickle
View on GitHub
It's a cooler way to store simple linear models.
☆26Jul 15, 2024Updated 2 years ago
avidale / compress-fasttext
View on GitHub
Tools for shrinking fastText models (in gensim format)
☆187May 3, 2024Updated 2 years ago
MaartenGr / BERTopic
View on GitHub
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
☆7,746May 13, 2026Updated 2 months ago
explosion / srsly
View on GitHub
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
☆484Mar 27, 2026Updated 3 months ago
HLasse / TextDescriptives
View on GitHub
A Python library for calculating a large variety of metrics from text
☆366May 5, 2026Updated 2 months ago
JulesBelveze / bert-squeeze
View on GitHub
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
☆85Feb 1, 2026Updated 5 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
kabirkhan / recon
View on GitHub
Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …
☆104Feb 26, 2024Updated 2 years ago
koaning / spacy-report
View on GitHub
Generate reports for spaCy models.
☆29May 27, 2022Updated 4 years ago
argilla-io / argilla
View on GitHub
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆5,038Updated this week
yash1994 / dframcy
View on GitHub
Dataframe Integration with spaCy.
☆103Mar 12, 2021Updated 5 years ago
ddangelov / Top2Vec
View on GitHub
Top2Vec learns jointly embedded topic, document and word vectors.
☆3,104Nov 14, 2024Updated last year
jenojp / negspacy
View on GitHub
spaCy pipeline object for negating concepts in text
☆280Apr 20, 2026Updated 3 months ago
explosion / catalogue
View on GitHub
Super lightweight function registries for your library
☆183Mar 27, 2026Updated 3 months ago