alibaba / ai-matrix
To make it easy to benchmark AI accelerators
☆183Updated 2 years ago
Alternatives and similar repositories for ai-matrix:
Users that are interested in ai-matrix are comparing it to the libraries listed below
- heterogeneity-aware-lowering-and-optimization☆254Updated last year
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆533Updated 2 years ago
- Computation using data flow graphs for scalable machine learning☆67Updated this week
- Place for meetup slides☆140Updated 4 years ago
- ☆127Updated 7 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated 2 weeks ago
- A home for the final text of all TVM RFCs.☆103Updated 5 months ago
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆197Updated 2 years ago
- Automated machine learning as an AI-HPC benchmark☆66Updated 2 years ago
- tophub autotvm log collections☆70Updated 2 years ago
- ☆125Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- Documentation for StreamExecutor open source proposal☆83Updated 8 years ago
- examples for tvm schedule API☆99Updated last year
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago
- This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.☆31Updated last year
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning fram…☆41Updated 4 years ago
- TVM integration into PyTorch☆452Updated 5 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- ☆24Updated 6 years ago
- OneFlow models for benchmarking.☆105Updated 7 months ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆267Updated last year
- ☆194Updated last year
- TensorFlow and TVM integration☆37Updated 4 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆82Updated 2 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆176Updated 2 years ago
- Dive into Deep Learning Compiler☆647Updated 2 years ago