A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆364Jul 30, 2024Updated last year
Alternatives and similar repositories for nn-Meter
Users that are interested in nn-Meter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆66Aug 5, 2024Updated last year
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆118Apr 18, 2023Updated 3 years ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,952Dec 14, 2023Updated 2 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,002Sep 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- quantize aware training package for NCNN on pytorch☆68Jul 27, 2021Updated 4 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Aug 14, 2021Updated 4 years ago
- Tengine 管子是用来快速生产 demo 的辅助工具☆11Jul 15, 2021Updated 4 years ago
- Model Quantization Benchmark☆868Apr 20, 2025Updated last year
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆201Apr 27, 2022Updated 4 years ago
- ncnn android benchmark app☆86Aug 10, 2021Updated 4 years ago
- [SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search☆13Nov 3, 2021Updated 4 years ago
- prebuild package for cross compiling riscv☆17Dec 28, 2021Updated 4 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆161Jun 16, 2020Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- (HotMobile'24) Salted Inference: Enhancing Privacy while Maintaining Efficiency of Split Inference in Mobile Computing☆17Jan 22, 2024Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆98Dec 21, 2023Updated 2 years ago
- ☆267Oct 30, 2019Updated 6 years ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,447Aug 30, 2024Updated last year
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆47Oct 26, 2023Updated 2 years ago
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆106Sep 29, 2021Updated 4 years ago
- code for NASViT☆67Apr 25, 2022Updated 4 years ago
- ☆52May 27, 2026Updated last month
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆959Apr 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- NASLib is a Neural Architecture Search (NAS) library for facilitating NAS research for the community by providing interfaces to several …☆591Nov 11, 2024Updated last year
- ☆10May 16, 2021Updated 5 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,647Updated this week
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆201Feb 18, 2021Updated 5 years ago
- Mobile vision models and code☆921Jun 17, 2026Updated last week
- ☆25Aug 27, 2021Updated 4 years ago
- ☆23Dec 8, 2022Updated 3 years ago
- AutoML tools chain☆848Feb 15, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"☆12Aug 18, 2021Updated 4 years ago
- (CVPR 2021, Oral) Dynamic Slimmable Network☆231Dec 31, 2021Updated 4 years ago
- ONNX Optimizer☆819Jun 12, 2026Updated 2 weeks ago
- Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours☆394Dec 14, 2020Updated 5 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆338Jul 14, 2024Updated last year
- NART = NART is not A RunTime, a deep learning inference framework.☆37Mar 2, 2023Updated 3 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆482Oct 23, 2024Updated last year