OAID / AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
☆736Updated 2 years ago
Alternatives and similar repositories for AutoKernel:
Users that are interested in AutoKernel are comparing it to the libraries listed below
- Tengine Convert Tool supports converting multi framworks' models into tmfile that suitable for Tengine-Lite AI framework.☆93Updated 3 years ago
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,458Updated last month
- ☆246Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆501Updated 5 months ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆1,018Updated this week
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆482Updated 5 months ago
- 🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.☆744Updated last year
- A primitive library for neural network☆1,331Updated 4 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆397Updated 2 years ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆552Updated 3 years ago
- TensorRT Plugin Autogen Tool☆369Updated 2 years ago
- row-major matmul optimization☆622Updated last year
- ☆96Updated 3 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated last week
- caffe model convert to onnx model☆175Updated 2 years ago
- VeriSilicon Tensor Interface Module☆234Updated 3 months ago
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expert…☆121Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆858Updated 3 months ago
- arm-neon☆90Updated 8 months ago
- 动手学习TVM核心原理教程☆61Updated 4 years ago
- Yinghan's Code Sample☆323Updated 2 years ago
- Model Quantization Benchmark☆799Updated this week
- A nnie quantization aware training tool on pytorch.☆239Updated 4 years ago
- Compiler Infrastructure for Neural Networks☆145Updated last year
- ☆106Updated 4 years ago
- TensorLayerX: A Unified Deep Learning and Reinforcement Learning Framework for All Hardwares, Backends and OS.☆528Updated 4 months ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆981Updated 7 months ago
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆221Updated 6 months ago