A block oriented training approach for inference time optimization.
☆34Aug 19, 2024Updated last year
Alternatives and similar repositories for superblock
Users that are interested in superblock are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Apr 13, 2026Updated last month
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆16Nov 22, 2022Updated 3 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Apr 13, 2026Updated last month
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- LLM Inference with Microscaling Format☆34Nov 12, 2024Updated last year
- ☆13Sep 25, 2023Updated 2 years ago
- ☆27Mar 14, 2024Updated 2 years ago
- ☆15Mar 30, 2024Updated 2 years ago
- ☆25Sep 9, 2024Updated last year
- QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning☆181Nov 11, 2025Updated 6 months ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago
- ☆14Mar 8, 2025Updated last year
- ☆16Dec 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆28Jun 16, 2025Updated 11 months ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆57Apr 13, 2026Updated last month
- [CVPR 2025] QuartDepth☆18Mar 24, 2025Updated last year
- A Javascript library to interface with the official Instagram Threads API☆22Jun 18, 2024Updated last year
- ☆15Apr 6, 2026Updated last month
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- A python library for converting Pytorch modules into a circle model that is a lightweight and efficient representation in ONE designed fo…☆15Updated this week
- Muon fsdp 2☆58Aug 8, 2025Updated 9 months ago
- ☆16Dec 9, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Use GHC's Memory Allocator from C☆12Feb 22, 2020Updated 6 years ago
- ☆14Jul 14, 2025Updated 10 months ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆16Nov 25, 2025Updated 6 months ago
- ☆15Apr 11, 2024Updated 2 years ago
- Wireshark Plugin for RSocket☆22Aug 6, 2024Updated last year
- ☆14Dec 4, 2020Updated 5 years ago
- Cache evaluation of nix functions☆20Apr 5, 2022Updated 4 years ago
- ☆10Apr 24, 2024Updated 2 years ago
- Pebble REBBLE watchface☆12Mar 3, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆32Apr 30, 2026Updated last month
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆101Jan 3, 2025Updated last year
- ☆15Jul 25, 2024Updated last year
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- ☆23Dec 16, 2025Updated 5 months ago
- ☆16Aug 19, 2024Updated last year
- ☆12Jul 30, 2025Updated 10 months ago