apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆11,761Updated this week
Related projects ⓘ
Alternatives and complementary repositories for tvm
- Open standard for machine learning interoperability☆17,893Updated this week
- Development repository for the Triton language and compiler☆13,311Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆14,642Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,619Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆10,764Updated this week
- Compiler for Neural Network hardware accelerators☆3,234Updated 5 months ago
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆30,426Updated this week
- Visualizer for neural network, deep learning and machine learning models☆28,075Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,247Updated 2 months ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,775Updated last year
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,296Updated this week
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba☆8,712Updated last week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆20,436Updated this week
- Simplify your onnx model☆3,849Updated 2 months ago
- A collection of pre-trained, state-of-the-art models in the ONNX format☆7,919Updated 6 months ago
- MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.☆4,932Updated 4 months ago
- Tutorials for creating and using ONNX models☆3,372Updated 3 months ago
- NumPy & SciPy for GPU☆9,449Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,690Updated this week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,139Updated this week
- CUDA Templates for Linear Algebra Subroutines☆5,629Updated this week
- AutoML library for deep learning☆9,145Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,043Updated 4 months ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆1,865Updated this week
- TensorFlow's Visualization Toolkit☆6,712Updated 2 weeks ago
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,797Updated 5 months ago
- Optimized primitives for collective multi-GPU communication☆3,227Updated last month
- Transformer related optimization, including BERT, GPT☆5,871Updated 7 months ago
- a language for fast, portable data-parallel computation☆5,890Updated this week
- The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologi…☆2,845Updated last month