OAID / AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
☆777Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for AutoKernel
- ☆247Updated last year
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,653Updated 2 months ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 3 years ago
- Tengine Convert Tool supports converting multi framworks' models into tmfile that suitable for Tengine-Lite AI framework.☆94Updated 3 years ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆957Updated this week
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆493Updated 3 weeks ago
- 🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.☆750Updated last year
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆393Updated last year
- TensorRT Plugin Autogen Tool☆367Updated last year
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆473Updated 3 weeks ago
- row-major matmul optimization☆591Updated last year
- Model Quantization Benchmark☆765Updated 5 months ago
- VeriSilicon Tensor Interface Module☆224Updated 3 months ago
- heterogeneity-aware-lowering-and-optimization☆253Updated 10 months ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆171Updated last year
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆226Updated last month
- arm-neon☆88Updated 3 months ago
- Compiler Infrastructure for Neural Networks☆143Updated last year
- Dive into Deep Learning Compiler☆643Updated 2 years ago
- Everything in Torch Fx☆341Updated 5 months ago
- NART = NART is not A RunTime, a deep learning inference framework.☆38Updated last year
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆103Updated this week
- A primitive library for neural network☆1,295Updated 2 weeks ago
- caffe model convert to onnx model☆175Updated last year
- A simple network quantization demo using pytorch from scratch.☆511Updated last year
- ☆93Updated 3 years ago
- A CPU tool for benchmarking the peak of floating points☆503Updated last month
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆918Updated 3 months ago
- DeepLearning Framework Performance Profiling Toolkit☆277Updated 2 years ago
- ONNX2Pytorch☆159Updated 3 years ago