apache / tvmLinks
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆12,613Updated this week
Alternatives and similar repositories for tvm
Users that are interested in tvm are comparing it to the libraries listed below
Sorting:
- Open standard for machine learning interoperability☆19,582Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,884Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,347Updated this week
- Development repository for the Triton language and compiler☆16,831Updated last week
- Compiler for Neural Network hardware accelerators☆3,311Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆93,116Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,144Updated last week
- a language for fast, portable data-parallel computation☆6,317Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,505Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆17,862Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,822Updated last year
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,786Updated this week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,978Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆22,039Updated this week
- CUDA Templates for Linear Algebra Subroutines☆8,427Updated last week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,503Updated last week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,592Updated last month
- Tutorials for creating and using ONNX models☆3,599Updated last year
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,811Updated last month
- Transformer related optimization, including BERT, GPT☆6,300Updated last year
- Visualizer for neural network, deep learning and machine learning models☆31,377Updated this week
- Simplify your onnx model☆4,172Updated 3 weeks ago
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,756Updated 4 years ago
- ☆1,655Updated 7 years ago
- Optimized primitives for collective multi-GPU communication☆4,051Updated this week
- NumPy & SciPy for GPU☆10,479Updated this week
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,113Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,448Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,800Updated last week
- Low-precision matrix multiplication☆1,816Updated last year