πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
β341Apr 25, 2025Updated last year
Alternatives and similar repositories for floret
Users that are interested in floret are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- skweak: A software toolkit for weak supervision applied to NLP tasksβ926Sep 2, 2024Updated last year
- β69Mar 17, 2022Updated 4 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ105Apr 23, 2024Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β156May 24, 2024Updated 2 years ago
- Active Learning for Text Classification in Pythonβ643May 24, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β244Jun 19, 2023Updated 2 years ago
- πΈ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyβ1,407Mar 27, 2026Updated 2 months ago
- just a bunch of useful embeddings for scikit-learn pipelinesβ526Feb 12, 2026Updated 3 months ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ403Jul 30, 2021Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-sβ¦β221Jan 20, 2025Updated last year
- Doubt your data, find bad labels.β516Jul 15, 2024Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β198Dec 18, 2022Updated 3 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!β480Feb 6, 2023Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)β152Jan 17, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pipeline components that support partial_fit.β46Jul 15, 2024Updated last year
- Efficient few-shot learning with Sentence Transformersβ2,743May 26, 2026Updated 2 weeks ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β117Oct 20, 2025Updated 7 months ago
- β29Jun 23, 2022Updated 3 years ago
- π¦ Integrating LLMs into structured NLP pipelinesβ1,392Mar 27, 2026Updated 2 months ago
- π¦ Contextually-keyed word vectorsβ1,673Mar 27, 2026Updated 2 months ago
- Fuzzy matching and more functionality for spaCy.β258Jul 6, 2024Updated last year
- βοΈContextual word checker for better suggestions (not actively maintained)β419Jan 31, 2025Updated last year
- π spaCy building blocks and visualizers for Streamlit appsβ857Jul 29, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Self-Supervision for Named Entity Disambiguation at the Tailβ218Jun 14, 2022Updated 3 years ago
- Information extraction from English and German texts based on predicate logicβ144Jun 6, 2023Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ172Nov 7, 2022Updated 3 years ago
- It's a cooler way to store simple linear models.β26Jul 15, 2024Updated last year
- Tools for shrinking fastText models (in gensim format)β185May 3, 2024Updated 2 years ago
- Leveraging BERT and c-TF-IDF to create easily interpretable topics.β7,659May 13, 2026Updated 3 weeks ago
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β481Mar 27, 2026Updated 2 months ago
- A Python library for calculating a large variety of metrics from textβ366May 5, 2026Updated last month
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,995Jun 1, 2026Updated last week
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Dataframe Integration with spaCy.β103Mar 12, 2021Updated 5 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β104Feb 26, 2024Updated 2 years ago
- Generate reports for spaCy models.β29May 27, 2022Updated 4 years ago
- π οΈ Tools for Transformers compression using PyTorch Lightning β‘β85Feb 1, 2026Updated 4 months ago
- Top2Vec learns jointly embedded topic, document and word vectors.β3,106Nov 14, 2024Updated last year
- spaCy pipeline object for negating concepts in textβ282Apr 20, 2026Updated last month
- Super lightweight function registries for your libraryβ182Mar 27, 2026Updated 2 months ago