TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆866Dec 24, 2025Updated 2 months ago
Alternatives and similar repositories for TinyNeuralNetwork
Users that are interested in TinyNeuralNetwork are comparing it to the libraries listed below
Sorting:
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,785Mar 28, 2024Updated last year
- Model Quantization Benchmark☆858Apr 20, 2025Updated 10 months ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,563Updated this week
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,271May 6, 2025Updated 9 months ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,258Sep 7, 2025Updated 5 months ago
- [ACCV2022 (Oral)] Efficient Hardware-aware Neural Architecture Search for Image Super-resolution on Mobile Devices☆18Oct 5, 2022Updated 3 years ago
- A model compression and acceleration toolbox based on pytorch.☆333Jan 12, 2024Updated 2 years ago
- Simplify your onnx model☆4,297Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,127Updated this week
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆486Oct 23, 2024Updated last year
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆256Apr 19, 2023Updated 2 years ago
- A primitive library for neural network☆1,366Nov 24, 2024Updated last year
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,662Jun 11, 2024Updated last year
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,940Dec 14, 2023Updated 2 years ago
- A Simple framework for image restoration, it includes ECBSR, ELAN and other SOTAs.☆50Nov 13, 2022Updated 3 years ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,613Nov 19, 2025Updated 3 months ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆363Jul 30, 2024Updated last year
- NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone…☆6,163Aug 8, 2024Updated last year
- Offline Quantization Tools for Deploy.☆142Dec 28, 2023Updated 2 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆924Nov 27, 2024Updated last year
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Apr 11, 2025Updated 10 months ago
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,374Updated this week
- RepVGG: Making VGG-style ConvNets Great Again☆3,458Feb 10, 2023Updated 3 years ago
- (ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning☆309Dec 8, 2022Updated 3 years ago
- Group Fisher Pruning for Practical Network Compression(ICML2021)☆161May 24, 2023Updated 2 years ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆555Jan 29, 2022Updated 4 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆654Mar 29, 2024Updated last year
- ☆46Jun 6, 2021Updated 4 years ago
- United Perception☆436Dec 5, 2022Updated 3 years ago
- edge-SR: Super-Resolution For The Masses☆62Jan 1, 2022Updated 4 years ago
- A simple network quantization demo using pytorch from scratch.☆542Jun 18, 2023Updated 2 years ago
- Support PyTorch model conversion with LiteRT.☆944Updated this week
- PyTorch Neural Network eXchange☆680Jan 30, 2026Updated last month
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,743Feb 23, 2026Updated last week
- OpenMMLab Model Deployment Framework☆3,100Sep 30, 2024Updated last year
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,619May 9, 2025Updated 9 months ago
- Brevitas: neural network quantization in PyTorch☆1,488Updated this week
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM …☆14,276Updated this week
- A Toolkit to Help Optimize Large Onnx Model☆165Oct 26, 2025Updated 4 months ago