mlcommons / mobile_open
MLPerf Mobile benchmarks
☆10Updated last week
Alternatives and similar repositories for mobile_open
Users that are interested in mobile_open are comparing it to the libraries listed below
Sorting:
- Mobile App Open☆54Updated last week
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆52Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆34Updated 3 years ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Updated 2 years ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆40Updated 5 months ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Updated last year
- ☆146Updated 2 years ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆37Updated last week
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Updated 10 months ago
- ☆34Updated 2 years ago
- ☆13Updated last week
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- ☆17Updated 3 years ago
- ☆76Updated 2 years ago
- [ECCV 2022] EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers☆13Updated 2 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆23Updated last year
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Updated last year
- Official MegEngine Implementation of Real-Time Intermediate Flow Estimation for Video Frame Interpolation☆29Updated 2 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated 2 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 5 years ago
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆17Updated last year
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier☆56Updated last year
- Code for ICML 2021 submission☆34Updated 4 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆25Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆25Updated 3 years ago