apple / ml-upscale
Export utility for unconstrained channel pruned models
☆66Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ml-upscale
- ☆69Updated 10 months ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆69Updated 2 years ago
- A light-weight implementation of ICCV2023 paper "Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset Rei…☆77Updated last year
- Utility to test the performance of CoreML models.☆67Updated 4 years ago
- ☆56Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆29Updated 2 months ago
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆15Updated last year
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆53Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆13Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆41Updated last year
- ☆18Updated 3 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆55Updated 4 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆35Updated 7 months ago
- PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)☆75Updated last year
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated last year
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆34Updated last month
- ☆214Updated 2 years ago
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆19Updated last year
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 2 years ago
- FOX-NAS: Fast, On-device and Explainable NeuralArchitecture Search☆10Updated 3 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆66Updated last year
- ☆67Updated 2 years ago
- ☆34Updated last year
- ☆20Updated 2 years ago
- ☆192Updated 3 years ago
- ☆21Updated 3 months ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- ☆24Updated this week
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆68Updated 3 years ago