NVIDIA / logits-processor-zooLinks
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆332Updated last month
Alternatives and similar repositories for logits-processor-zoo
Users that are interested in logits-processor-zoo are comparing it to the libraries listed below
Sorting:
- Manage scalable open LLM inference endpoints in Slurm clusters☆268Updated last year
- code for training & evaluating Contextual Document Embedding models☆196Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆230Updated 9 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆284Updated 5 months ago
- Easily embed, cluster and semantically label text datasets☆560Updated last year
- awesome synthetic (text) datasets☆291Updated last month
- Let's build better datasets, together!☆260Updated 7 months ago
- Late Interaction Models Training & Retrieval☆521Updated 3 weeks ago
- ☆129Updated 4 months ago
- ☆174Updated last month
- Best practices for distilling large language models.☆569Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆142Updated 3 weeks ago
- A compact LLM pretrained in 9 days by using high quality data☆320Updated 3 months ago
- ☆529Updated 8 months ago
- Official repository for ORPO☆462Updated last year
- An Open Source Toolkit For LLM Distillation☆703Updated last month
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 6 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆210Updated 3 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆195Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆125Updated last week
- ☆208Updated 5 months ago
- PyTorch building blocks for the OLMo ecosystem☆269Updated this week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆360Updated 11 months ago
- ☆154Updated 8 months ago
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 8 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 10 months ago
- An extension of the nanoGPT repository for training small MOE models.☆164Updated 4 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆243Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆133Updated 7 months ago