Notebooks for training universal 0-shot classifiers on many different tasks
☆140Dec 28, 2024Updated last year
Alternatives and similar repositories for zeroshot-classifier
Users that are interested in zeroshot-classifier are comparing it to the libraries listed below
Sorting:
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆89Aug 25, 2023Updated 2 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆81Feb 10, 2026Updated 3 weeks ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Sep 18, 2025Updated 5 months ago
- Train huggingface models on top of Prodigy annotations☆21Feb 19, 2024Updated 2 years ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated last year
- SpanMarker for Named Entity Recognition☆464Jan 8, 2025Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated 11 months ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- Efficient few-shot learning with Sentence Transformers☆2,690Dec 11, 2025Updated 2 months ago
- A library for working with prompt templates locally or on the Hugging Face Hub.☆56Mar 5, 2025Updated last year
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Robust and fast topic models with sentence-transformers.☆94Mar 1, 2026Updated last week
- ☆29Oct 24, 2025Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- Fast Multimodal Semantic Deduplication & Filtering☆892Jan 20, 2026Updated last month
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 7 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 5 months ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Python SDK for Galileo's NLP and CV Studio.☆17Updated this week
- Learning from Neighbors: Unsupervised Text Classification☆17Sep 27, 2022Updated 3 years ago
- Tools for merging pretrained large language models.☆19Jun 12, 2024Updated last year
- N-gram keyword extraction using spaCy and pretrained language models☆63Apr 11, 2022Updated 3 years ago
- Fast State-of-the-Art Static Embeddings☆2,007Feb 28, 2026Updated last week
- A game theoretic approach to explain the output of any machine learning model.☆14Feb 28, 2022Updated 4 years ago
- Just another sentiment wrapper.☆18Dec 11, 2021Updated 4 years ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- alternative way to calculating self attention☆18May 25, 2024Updated last year
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 9 months ago
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,114Mar 2, 2026Updated last week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,507Feb 17, 2026Updated 2 weeks ago
- Very minimal (and stateless) agent framework☆44Jan 12, 2025Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago