OAID / AutoKernel
AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。
☆736Updated 2 years ago
Alternatives and similar repositories for AutoKernel:
Users that are interested in AutoKernel are comparing it to the libraries listed below
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,435Updated 5 months ago
- ☆243Updated last year
- Tengine Convert Tool supports converting multi framworks' models into tmfile that suitable for Tengine-Lite AI framework.☆92Updated 3 years ago
- TVM Documentation in Chinese Simplified / TVM 中文文档☆805Updated this week
- 🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.☆742Updated last year
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆498Updated 3 months ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆474Updated 3 months ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆939Updated 6 months ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆532Updated 2 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆394Updated 2 years ago
- VeriSilicon Tensor Interface Module☆229Updated last month
- Compiler Infrastructure for Neural Networks☆145Updated last year
- TensorRT Plugin Autogen Tool☆369Updated last year
- heterogeneity-aware-lowering-and-optimization☆254Updated last year
- SuperSonic, a new open-source framework to allow compiler developers to integrate RL into compilers easily, regardless of their RL expert…☆121Updated last year
- arm-neon☆89Updated 6 months ago
- ☆95Updated 3 years ago
- caffe model convert to onnx model☆174Updated 2 years ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆178Updated last year
- FlagPerf is an open-source software platform for benchmarking AI chips.☆323Updated 2 weeks ago
- ☆107Updated 4 years ago
- row-major matmul optimization☆602Updated last year
- 【深度学习模型部署框架】支持tf/torch/trt/trtllm/vllm以及更多nn框架,支持dynamic batching、streaming模式,支持python/c++双语言,可限制,可拓展,高性能。帮助用户快速地将模型部署到线上,并通过http/rpc接口方式…☆155Updated last month
- Edge Machine Learning Library☆193Updated 2 years ago
- A library for high performance deep learning inference on NVIDIA GPUs.☆552Updated 3 years ago
- Yinghan's Code Sample☆305Updated 2 years ago
- A primitive library for neural network☆1,311Updated 2 months ago
- CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/Solu…☆49Updated last year
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆215Updated 4 months ago