A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆389Jul 8, 2025Updated 10 months ago
Alternatives and similar repositories for logits-processor-zoo
Users that are interested in logits-processor-zoo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition☆58Dec 27, 2024Updated last year
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- LLM KV cache compression made easy☆1,066Updated this week
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆150Aug 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆56Nov 22, 2024Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆51Aug 7, 2025Updated 9 months ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year
- Official repository for ORPO☆483May 31, 2024Updated last year
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Applied AI experiments and examples for PyTorch☆320Aug 22, 2025Updated 8 months ago
- ☆77Apr 20, 2026Updated 2 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,396Apr 17, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 6 months ago
- Efficient Triton Kernels for LLM Training☆6,331Apr 30, 2026Updated last week
- Composable inference algorithms with LLMs and programmable logic☆70Dec 4, 2024Updated last year
- ☆50Mar 14, 2024Updated 2 years ago
- Super-fast Structured Outputs☆752Apr 29, 2026Updated last week
- Minimalistic large language model 3D-parallelism training☆2,678Apr 7, 2026Updated last month
- ☆98Jul 4, 2025Updated 10 months ago
- Efficient few-shot learning with Sentence Transformers☆2,728Apr 17, 2026Updated 3 weeks ago
- ☆19Jul 26, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the CHAMPS Predicting Molecular Properties Kaggle competition☆52Aug 31, 2019Updated 6 years ago
- Easily embed, cluster and semantically label text datasets☆603Mar 28, 2024Updated 2 years ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆2,011Apr 4, 2026Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,033Apr 20, 2026Updated 2 weeks ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 11 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆234Jun 7, 2025Updated 11 months ago
- ☆17Apr 3, 2024Updated 2 years ago
- PyTorch native post-training library☆5,750May 1, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,782Mar 24, 2026Updated last month
- Training code for Sparse Autoencoders on Embedding models☆39Apr 25, 2026Updated 2 weeks ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆3,190Updated this week
- A simple python wrapper for using the Caddy API☆26Apr 28, 2026Updated last week
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,209Apr 27, 2026Updated last week
- Bringing BERT into modernity via both architecture changes and scaling☆1,668Mar 1, 2026Updated 2 months ago