A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆391Jul 8, 2025Updated 11 months ago
Alternatives and similar repositories for logits-processor-zoo
Users that are interested in logits-processor-zoo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition☆58Dec 27, 2024Updated last year
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- LLM KV cache compression made easy☆1,112Jun 10, 2026Updated last week
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆151Aug 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆56Nov 22, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 10 months ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year
- Official repository for ORPO☆481May 31, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆7,154Updated this week
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Super-fast Structured Outputs☆793Updated this week
- Applied AI experiments and examples for PyTorch☆323Aug 22, 2025Updated 9 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,447Jun 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 7 months ago
- This repository contains the implementation of various techniques to segment brain tumors from MRI images.☆10Aug 17, 2023Updated 2 years ago
- Efficient Triton Kernels for LLM Training☆6,444Updated this week
- ☆50Mar 14, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,715May 26, 2026Updated 3 weeks ago
- Efficient few-shot learning with Sentence Transformers☆2,746May 26, 2026Updated 3 weeks ago
- ☆101Jul 4, 2025Updated 11 months ago
- ☆19Jul 26, 2024Updated last year
- Easily embed, cluster and semantically label text datasets☆608Mar 28, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,020Apr 4, 2026Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,091May 26, 2026Updated 3 weeks ago
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆237Jun 7, 2025Updated last year
- ☆17Apr 3, 2024Updated 2 years ago
- PyTorch native post-training library☆5,772Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,833Mar 24, 2026Updated 2 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Jul 17, 2024Updated last year
- A simple python wrapper for using the Caddy API☆27May 20, 2026Updated 3 weeks ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆3,418Updated this week
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,251Jun 8, 2026Updated last week
- Bringing BERT into modernity via both architecture changes and scaling☆1,691Mar 1, 2026Updated 3 months ago
- ☆22Oct 14, 2024Updated last year