alibaba / TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆815Updated last week
Alternatives and similar repositories for TinyNeuralNetwork:
Users that are interested in TinyNeuralNetwork are comparing it to the libraries listed below
- A parser, editor and profiler tool for ONNX models.☆421Updated 2 months ago
- Model Quantization Benchmark☆793Updated 2 months ago
- PyTorch Neural Network eXchange☆563Updated this week
- A simple network quantization demo using pytorch from scratch.☆521Updated last year
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,654Updated 11 months ago
- ☆314Updated last year
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated 7 months ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,446Updated 3 weeks ago
- A primitive library for neural network☆1,324Updated 3 months ago
- ONNX Optimizer☆681Updated last week
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆346Updated 7 months ago
- Deploy your model with TensorRT quickly.☆765Updated last year
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆394Updated 2 years ago
- 😎 A Collection of Awesome NCNN-based Projects☆732Updated 2 years ago
- nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performan…☆710Updated this week
- ☆1,017Updated last year
- Simple samples for TensorRT programming☆1,586Updated last week
- On-Device Training Under 256KB Memory [NeurIPS'22]☆462Updated 11 months ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆537Updated 11 months ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆250Updated last year
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆327Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆300Updated 6 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆480Updated 4 months ago
- Offline Quantization Tools for Deploy.☆125Updated last year
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆767Updated this week
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆500Updated 4 months ago
- TensorRT Plugin Autogen Tool☆369Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,249Updated last week
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,902Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago