apache / tvmLinks

Open deep learning compiler stack for cpu, gpu and specialized accelerators

☆12,492

Alternatives and similar repositories for tvm

Users that are interested in tvm are comparing it to the libraries listed below

Sorting:

pytorch / glow
Compiler for Neural Network hardware accelerators
☆3,310Updated last year
onnx / onnx
Open standard for machine learning interoperability
☆19,345Updated this week
NVIDIA / TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…
☆11,912Updated last week
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,856Updated this week
horovod / horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
☆14,558Updated this week
triton-lang / triton
Development repository for the Triton language and compiler
☆16,320Updated this week
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,261Updated last year
Tencent / ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
☆21,843Updated this week
microsoft / onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
☆17,369Updated this week
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,889Updated last week
iree-org / iree
A retargetable MLIR-based machine learning compiler and runtime toolkit.
☆3,241Updated last week
daquexian / onnx-simplifier
Simplify your onnx model
☆4,121Updated 10 months ago
openxla / xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
☆3,383Updated this week
halide / Halide
a language for fast, portable data-parallel computation
☆6,139Updated last week
openvinotoolkit / openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
☆8,647Updated this week
BBuf / tvm_mlir_learn
compiler learning resources collect.
☆2,457Updated 4 months ago
onnx / tutorials
Tutorials for creating and using ONNX models
☆3,577Updated last year
google / gemmlowp
Low-precision matrix multiplication
☆1,812Updated last year
microsoft / MMdnn
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…
☆5,810Updated 2 weeks ago
XiaoMi / mace
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.
☆5,017Updated last year
google / XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,072Updated last week
onnx / models
A collection of pre-trained, state-of-the-art models in the ONNX format
☆8,857Updated last month
IntelLabs / distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distille…
☆4,400Updated 2 years ago
triton-inference-server / server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
☆9,531Updated last week
NVIDIA / cutlass
CUDA Templates for Linear Algebra Subroutines
☆8,149Updated this week
flame / how-to-optimize-gemm
☆1,902Updated 2 years ago
Tencent / PocketFlow
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
☆2,910Updated 2 years ago
NVIDIA / DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…
☆5,475Updated this week
tensorflow / mlir
"Multi-Level Intermediate Representation" Compiler Infrastructure
☆1,752Updated 4 years ago
dmlc / nnvm
☆1,658Updated 6 years ago