Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
☆1,161Apr 16, 2026Updated this week
Alternatives and similar repositories for pruna
Users that are interested in pruna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a ComfyUI node that integrates pruna☆66Sep 8, 2025Updated 7 months ago
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 10 months ago
- ☆33Oct 3, 2023Updated 2 years ago
- Code for reproducing the paper "Space-Time Continuous PDE Forecasting using Equivariant Neural Fields" (https://arxiv.org/abs/2406.06660)…☆14Nov 18, 2025Updated 5 months ago
- A simple Fast API Backend for Ironclad/rivet☆26Jan 9, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆70Dec 4, 2025Updated 4 months ago
- MoD Control Tile Upscaler for SDXL Pipeline☆61Mar 8, 2025Updated last year
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆17May 21, 2025Updated 10 months ago
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 9 months ago
- PaiNN in jax☆11Jan 14, 2025Updated last year
- This Denoising Force Field (DFF) codebase provides a Pytorch framework for the method presented in Two for one: Diffusion models and forc…☆67May 22, 2024Updated last year
- Fast State-of-the-Art Static Embeddings☆2,024Apr 10, 2026Updated last week
- NLP with Rust for Python 🦀🐍☆72May 13, 2025Updated 11 months ago
- ☆23Jun 5, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Synthetic Text Dataset Generation for LLM projects☆58Updated this week
- ☆12Jan 4, 2024Updated 2 years ago
- Plug-and-play document AI with zero-shot models.☆124Feb 16, 2026Updated 2 months ago
- Crispy reranking models by Mixedbread☆51Sep 17, 2025Updated 7 months ago
- Drift detection module for machine learning pipelines.☆24Jun 21, 2023Updated 2 years ago
- Interpretable ML for TabPFN☆51Jul 13, 2025Updated 9 months ago
- just a bunch of useful embeddings for scikit-learn pipelines☆525Feb 12, 2026Updated 2 months ago
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- PyTorch native quantization and sparsity for training and inference☆2,786Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cool extensions built for the nbdev framework☆14Jun 6, 2023Updated 2 years ago
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,384Apr 11, 2026Updated last week
- ☆171Updated this week
- A minimalistic, hackable code base to finetune Wan video generation model☆50Feb 22, 2026Updated last month
- ☆12Dec 30, 2020Updated 5 years ago
- C++ inference engine for running GLiNER (Generalist and Lightweight Named Entity Recognition) models☆46Dec 11, 2024Updated last year
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 11 months ago
- Guide: from fragile multi-agent app to prod ready with orra - code and resources.☆14Mar 24, 2025Updated last year
- Examples for beam.cloud☆25Aug 26, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Notebooks using the Neural Magic libraries 📓☆39Jul 24, 2024Updated last year
- GLiNER inference in JavaScript☆24Mar 2, 2025Updated last year
- Annotated implementations of equivariant (graph) neural networks in Jax: EGNN, SEGNN, NequIP.☆43Mar 1, 2025Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,983Apr 10, 2026Updated last week
- ☆15Jan 12, 2025Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆85Feb 10, 2026Updated 2 months ago
- Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild☆3,403Updated this week