sony / mct_quantizers
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for mct_quantizers
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆17Updated this week
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 2 years ago
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆22Updated 3 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆324Updated this week
- ☆18Updated 3 years ago
- Neural Architecture Search for Neural Network Libraries☆56Updated 9 months ago
- Dynamic Neural Architecture Search Toolkit☆29Updated 5 months ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆24Updated last month
- Open Source Projects from Pallas Lab☆19Updated 3 years ago
- A Python package with command-line utilities and scripts to aid the development of machine learning models for Silicon Lab's embedded pl…☆51Updated 4 months ago
- [ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition☆30Updated 3 years ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆20Updated 2 years ago
- ☆201Updated last year
- Deep Compression for PyTorch Model Deployment on Microcontrollers☆17Updated 3 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆31Updated last year
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆19Updated 2 years ago
- ACL 2023☆38Updated last year
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- Binarize convolutional neural networks using pytorch☆134Updated 2 years ago
- Model compression for ONNX☆74Updated last month
- Repository for CPU Kernel Generation for LLM Inference☆24Updated last year
- Reference implementations of popular Binarized Neural Networks☆104Updated last week
- TFLite model analyzer & memory optimizer☆120Updated 9 months ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆43Updated last year
- Bring your AI to the Edge - Starting from building the ML model to the selection of the target platform to the optimization and implement…☆52Updated 2 years ago
- ☆20Updated 2 years ago
- Awesome Quantization Paper lists with Codes☆12Updated 3 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆126Updated last week