superlich7 / caffe
This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning framework when running on Machine Learning Unit(MLU).
☆41Updated 4 years ago
Alternatives and similar repositories for caffe:
Users that are interested in caffe are comparing it to the libraries listed below
- examples for tvm schedule API☆100Updated last year
- 动手学习TVM核心原理教程☆61Updated 4 years ago
- ☆29Updated last year
- tophub autotvm log collections☆70Updated 2 years ago
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- code reading for tvm☆76Updated 3 years ago
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆110Updated this week
- CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/Solu…☆51Updated last week
- Development repository for the Triton-Linalg conversion☆182Updated last month
- ☆38Updated 3 years ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- A home for the final text of all TVM RFCs.☆102Updated 6 months ago
- TVM tutorial☆66Updated 6 years ago
- ☆95Updated 3 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆519Updated 4 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆176Updated 2 years ago
- ☆145Updated 2 months ago
- ☆25Updated 11 months ago
- symmetric int8 gemm☆66Updated 4 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆196Updated 2 years ago
- Fast CUDA Kernels for ResNet Inference.☆173Updated 5 years ago
- Place for meetup slides☆140Updated 4 years ago
- ☆17Updated 4 years ago
- NART = NART is not A RunTime, a deep learning inference framework.☆38Updated 2 years ago
- CUDA PTX-ISA Document 中文翻译版☆37Updated 2 weeks ago
- To make it easy to benchmark AI accelerators☆183Updated 2 years ago
- 14 basic topics for VEGA64 performance optmization☆54Updated 4 years ago