Open source no-code system for text annotation and building of text classifiers
☆271May 26, 2025Updated 9 months ago
Alternatives and similar repositories for label-sleuth
Users that are interested in label-sleuth are comparing it to the libraries listed below
Sorting:
- A package dedicated for running benchmark agreement testing☆17Sep 18, 2025Updated 5 months ago
- Semantically Structured Sentence Embeddings☆71Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 3 weeks ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 5 months ago
- ☆76Oct 25, 2021Updated 4 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,884Mar 2, 2026Updated last week
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28May 18, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 5 months ago
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact…☆1,470Dec 9, 2024Updated last year
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105May 20, 2022Updated 3 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Feb 21, 2026Updated 2 weeks ago
- ☆18Feb 28, 2022Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- A Simple Bulk Labelling Tool☆599Jul 29, 2025Updated 7 months ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Feb 16, 2026Updated 3 weeks ago
- A personal knowledge base that I can dump information to and help me learn☆25May 26, 2025Updated 9 months ago
- ☆19Nov 4, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11May 19, 2022Updated 3 years ago
- ☆11Jun 18, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- ☆37Nov 22, 2025Updated 3 months ago
- ☆44Mar 3, 2023Updated 3 years ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆475Feb 6, 2023Updated 3 years ago
- A few-shot learning method based on siamese networks.☆28Feb 20, 2023Updated 3 years ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,413Aug 30, 2023Updated 2 years ago
- PYthon Automated Term Extraction☆318Feb 8, 2023Updated 3 years ago
- Doubt your data, find bad labels.☆517Jul 15, 2024Updated last year