apple / ml-quant
Research publication code for "Least Squares Binary Quantization of Neural Networks"
☆78Updated last year
Related projects: ⓘ
- ☆56Updated 2 years ago
- Export utility for unconstrained channel pruned models☆66Updated last year
- A collection of metrics to profile a single deep learning model or compare two different deep learning models☆24Updated 10 months ago
- Customized matrix multiplication kernels☆53Updated 2 years ago
- Butterfly matrix multiplication in PyTorch☆160Updated 11 months ago
- ☆18Updated 2 years ago
- PyProf2: PyTorch Profiling tool☆83Updated 4 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆26Updated 2 years ago
- ☆22Updated 2 years ago
- Reference implementations of popular Binarized Neural Networks☆104Updated last week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆173Updated 3 months ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆38Updated 5 years ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 4 years ago
- ☆74Updated 2 years ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆22Updated 2 years ago
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆19Updated last year
- Test data for DALI project☆39Updated 3 weeks ago
- Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules☆40Updated 2 years ago
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion☆40Updated 3 years ago
- Dynamic Neural Architecture Search Toolkit☆28Updated 3 months ago
- Repository containing pruned models and related information☆35Updated 3 years ago
- Using ideas from product quantization for state-of-the-art neural network compression.☆145Updated 3 years ago
- Arch-Net: Model Distillation for Architecture Agnostic Model Deployment☆22Updated 2 years ago
- Code for the paper "Training Binary Neural Networks with Bayesian Learning Rule☆37Updated 2 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆45Updated 3 years ago
- 3rd place solution for NeurIPS 2019 MicroNet challenge☆35Updated 4 years ago
- Implemented here a Binary Neural Network (BNN) achieving nearly state-of-art results but recorded a significant reduction in memory usage…☆68Updated 3 years ago
- ☆26Updated last year
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆229Updated last year
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago