mlcommons / mobile_openLinks
MLPerf Mobile benchmarks
☆15Updated last week
Alternatives and similar repositories for mobile_open
Users that are interested in mobile_open are comparing it to the libraries listed below
Sorting:
- Mobile App Open☆67Updated this week
- ☆170Updated 2 years ago
- ☆208Updated 4 years ago
- [MobiSys 2020] Fast and Scalable In-memory Deep Multitask Learning via Neural Weight Virtualization☆15Updated 5 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆58Updated 3 years ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆113Updated last week
- ☆16Updated 2 months ago
- ☆244Updated 3 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Updated 4 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Updated 4 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆27Updated 2 years ago
- Awesome Quantization Paper lists with Codes☆10Updated 4 years ago
- Study Group of Deep Learning Compiler☆167Updated 3 years ago
- CUDA Templates for Linear Algebra Subroutines☆101Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆73Updated 3 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆64Updated last year
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆93Updated 3 years ago
- ☆78Updated 3 years ago
- Low Precision Arithmetic Simulation in PyTorch☆291Updated last year
- Inference of quantization aware trained networks using TensorRT☆83Updated 3 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆114Updated 2 years ago
- [ICML'21 Oral] I-BERT: Integer-only BERT Quantization☆265Updated 3 years ago
- Reproducing Quantization paper PACT☆65Updated 3 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆142Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Updated 3 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453Updated 2 years ago
- Reference implementations of popular Binarized Neural Networks☆109Updated this week
- Experimental deep learning framework written in Rust☆15Updated 3 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆175Updated this week