Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
☆1,113Feb 27, 2026Updated last week
Alternatives and similar repositories for pruna
Users that are interested in pruna are comparing it to the libraries listed below
Sorting:
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- This is a ComfyUI node that integrates pruna☆66Sep 8, 2025Updated 5 months ago
- MoD Control Tile Upscaler for SDXL Pipeline☆61Mar 8, 2025Updated 11 months ago
- Notebooks using the Neural Magic libraries 📓☆39Jul 24, 2024Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Dec 4, 2025Updated 3 months ago
- Fast State-of-the-Art Static Embeddings☆2,007Updated this week
- Courses on building, compressing, evaluating, and deploying efficient AI models.☆68Nov 10, 2025Updated 3 months ago
- ☆102Jan 19, 2026Updated last month
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆16May 21, 2025Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆72May 13, 2025Updated 9 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Feb 24, 2025Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆63Feb 6, 2025Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆86Feb 24, 2023Updated 3 years ago
- PyTorch native quantization and sparsity for training and inference☆2,707Updated this week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,277Feb 26, 2026Updated last week
- Synthetic Text Dataset Generation for LLM projects☆56Feb 27, 2026Updated last week
- Python library to use Pleias-RAG models☆68May 1, 2025Updated 10 months ago
- GLiNER inference in JavaScript☆22Mar 2, 2025Updated last year
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆26Jan 27, 2026Updated last month
- YesBut - Multimodal Satire Comprehension Dataset☆18Oct 23, 2024Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆522Feb 12, 2026Updated 3 weeks ago
- A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.☆47Aug 13, 2025Updated 6 months ago
- WIP Pytorch code for stably training single-step, mode-dropping, deterministic autoencoders☆44Jul 26, 2025Updated 7 months ago
- ☆167Feb 26, 2026Updated last week
- ☆57Jul 6, 2025Updated 8 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,915Updated this week
- MBASE, an LLM SDK in C++☆56Jul 9, 2025Updated 7 months ago
- Crispy reranking models by Mixedbread☆47Sep 17, 2025Updated 5 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆206Aug 31, 2024Updated last year
- Official implementation of Half-Quadratic Quantization (HQQ)☆915Feb 26, 2026Updated last week
- Plug-and-play document AI with zero-shot models.☆124Feb 16, 2026Updated 2 weeks ago
- This Denoising Force Field (DFF) codebase provides a Pytorch framework for the method presented in Two for one: Diffusion models and forc…☆67May 22, 2024Updated last year
- A web-based tool that helps prepare source code for Large Language Models (LLMs) by combining multiple files into a single text file. Per…☆42Jan 11, 2025Updated last year
- Official inference library for pre-processing of Mistral models☆861Feb 27, 2026Updated last week
- ☆175Nov 8, 2025Updated 3 months ago
- Making Flux go brrr on GPUs.☆163Jan 5, 2026Updated 2 months ago
- A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation☆835Sep 17, 2025Updated 5 months ago
- Drift detection module for machine learning pipelines.☆24Jun 21, 2023Updated 2 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year