alibaba / ai-matrix
To make it easy to benchmark AI accelerators
☆182Updated 2 years ago
Alternatives and similar repositories for ai-matrix:
Users that are interested in ai-matrix are comparing it to the libraries listed below
- heterogeneity-aware-lowering-and-optimization☆254Updated 11 months ago
- Place for meetup slides☆140Updated 4 years ago
- A home for the final text of all TVM RFCs.☆101Updated 3 months ago
- Computation using data flow graphs for scalable machine learning☆67Updated this week
- DeepLearning Framework Performance Profiling Toolkit☆281Updated 2 years ago
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated last month
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆196Updated 2 years ago
- oneflow documentation☆68Updated 6 months ago
- OneFlow models for benchmarking.☆105Updated 5 months ago
- tophub autotvm log collections☆70Updated 2 years ago
- TVM integration into PyTorch☆453Updated 5 years ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆531Updated 2 years ago
- Automated machine learning as an AI-HPC benchmark☆64Updated 2 years ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆266Updated last year
- ☆196Updated last year
- examples for tvm schedule API☆98Updated last year
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆175Updated 2 years ago
- 动手学习TVM核心原理教程☆59Updated 4 years ago
- ☆30Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆37Updated 10 months ago
- This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning fram…☆41Updated 4 years ago
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- This repository contains the results and code for the MLPerf™ Inference v0.5 benchmark.☆55Updated last year
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆215Updated this week
- ☆127Updated 6 years ago
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆90Updated last year
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆73Updated 4 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago