ML model optimization product to accelerate inference.
☆325Jun 2, 2025Updated last year
Alternatives and similar repositories for sparsify
Users that are interested in sparsify are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes☆388Jun 2, 2025Updated last year
- Top-level directory for documentation and general content☆120Jun 2, 2025Updated last year
- Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models☆2,143Jun 2, 2025Updated last year
- Sparsity-aware deep learning inference runtime for CPUs☆3,161Jun 2, 2025Updated last year
- ☆16Sep 27, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ONNX model visualizer☆88Jun 28, 2023Updated 2 years ago
- ☆13Jun 10, 2026Updated last week
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 5 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Aug 17, 2022Updated 3 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Run zero-shot prediction models on your data☆37Dec 19, 2024Updated last year
- A research library for pytorch-based neural network pruning, compression, and more.☆163Nov 28, 2022Updated 3 years ago
- ☆25Sep 19, 2025Updated 9 months ago
- Repo for the Naive Bayesian Meetup Group☆11Nov 12, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- Tutorial on how to convert machine learned models into ONNX☆14Mar 11, 2023Updated 3 years ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,655Jun 12, 2026Updated last week
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains☆1,724Oct 8, 2023Updated 2 years ago
- ☆12Sep 22, 2024Updated last year
- Simple tool to change the INPUT and OUTPUT shape of ONNX.☆15Apr 1, 2025Updated last year
- PyTorch Lightning + Hydra. + Timm: A very user-friendly template for rapid and reproducible MLOps with best practices. ⚡🔥⚡☆17Mar 3, 2023Updated 3 years ago
- My journey during 10 weeks of building FiftyOne plugins☆22Nov 12, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,642Updated this week
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆28Feb 21, 2023Updated 3 years ago
- Efficient GPU kernels for mixed-precision Vision Transformers in Triton☆17Sep 18, 2025Updated 9 months ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- Refine high-quality datasets and visual AI models☆10,777Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,720Apr 9, 2026Updated 2 months ago
- A model compression and acceleration toolbox based on pytorch.☆331Jan 12, 2024Updated 2 years ago
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆885Aug 20, 2024Updated last year
- Ranking of fine-tuned HF models as base models.☆36Sep 17, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Count the MACs / FLOPs of PyTorch models☆636Mar 11, 2026Updated 3 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,688Oct 23, 2024Updated last year
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- Simple tool to combine(merge) onnx models. Simple Network Combine Tool for ONNX.☆18Oct 8, 2025Updated 8 months ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆53Mar 8, 2021Updated 5 years ago
- Sharable Grakn knowledge graphs☆13Dec 28, 2022Updated 3 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Oct 18, 2023Updated 2 years ago