apple / ml-upscale
Export utility for unconstrained channel pruned models
☆71Updated last year
Alternatives and similar repositories for ml-upscale:
Users that are interested in ml-upscale are comparing it to the libraries listed below
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- ☆77Updated last year
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆20Updated last year
- ☆19Updated 3 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆65Updated 6 months ago
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆86Updated last year
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆79Updated last year
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Updated last year
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆55Updated last year
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆30Updated 5 months ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆47Updated last year
- Utility to test the performance of CoreML models.☆68Updated 4 years ago
- ☆66Updated 2 years ago
- ☆31Updated 7 months ago
- ☆74Updated 2 years ago
- ☆34Updated last year
- Dynamic Neural Architecture Search Toolkit☆29Updated 2 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆35Updated 11 months ago
- ☆197Updated 3 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆33Updated 3 years ago
- ☆220Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆41Updated 4 months ago
- ☆17Updated 2 years ago
- ☆44Updated 3 years ago
- In progress.☆63Updated 10 months ago
- ☆134Updated last year
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆32Updated last year
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆102Updated last year