A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆382Jul 8, 2025Updated 7 months ago
Alternatives and similar repositories for logits-processor-zoo
Users that are interested in logits-processor-zoo are comparing it to the libraries listed below
Sorting:
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆147Aug 9, 2024Updated last year
- 1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition☆55Dec 27, 2024Updated last year
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- LLM KV cache compression made easy☆936Feb 23, 2026Updated last week
- Official repository for ORPO☆472May 31, 2024Updated last year
- ☆18May 22, 2024Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,314Feb 20, 2026Updated 2 weeks ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 4 months ago
- ☆21Oct 14, 2024Updated last year
- Efficient Triton Kernels for LLM Training☆6,189Updated this week
- Super-fast Structured Outputs☆706Updated this week
- Tools for merging pretrained large language models.☆6,826Updated this week
- Easily embed, cluster and semantically label text datasets☆596Mar 28, 2024Updated last year
- Applied AI experiments and examples for PyTorch☆319Aug 22, 2025Updated 6 months ago
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- ☆55Nov 22, 2024Updated last year
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆2,817Updated this week
- ☆92Jul 4, 2025Updated 8 months ago
- Minimalistic large language model 3D-parallelism training☆2,579Feb 19, 2026Updated 2 weeks ago
- Composable inference algorithms with LLMs and programmable logic☆69Dec 4, 2024Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,915Updated this week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Oct 9, 2025Updated 4 months ago
- Pre-train Static Word Embeddings☆93Sep 9, 2025Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- A pytorch quantization backend for optimum☆1,030Nov 21, 2025Updated 3 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,602Dec 20, 2025Updated 2 months ago
- Bringing BERT into modernity via both architecture changes and scaling☆1,632Updated this week
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 6 months ago
- ☆14May 3, 2022Updated 3 years ago
- ☆12Jul 17, 2024Updated last year
- ☆50Mar 14, 2024Updated last year
- PyTorch native post-training library☆5,691Feb 27, 2026Updated last week
- PyTorch native quantization and sparsity for training and inference☆2,707Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,108Feb 23, 2026Updated last week
- Fast Multimodal Semantic Deduplication & Filtering☆892Jan 20, 2026Updated last month
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,688Feb 5, 2026Updated last month