apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆11,978Updated this week
Alternatives and similar repositories for tvm:
Users that are interested in tvm are comparing it to the libraries listed below
- Open standard for machine learning interoperability☆18,369Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,363Updated last week
- Development repository for the Triton language and compiler☆14,294Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,785Updated last year
- Compiler for Neural Network hardware accelerators☆3,267Updated 8 months ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,134Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,712Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,115Updated 7 months ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,684Updated this week
- Visualizer for neural network, deep learning and machine learning models☆29,271Updated this week
- NumPy & SciPy for GPU☆9,739Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆15,477Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,965Updated this week
- Transformer related optimization, including BERT, GPT☆6,003Updated 10 months ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆86,504Updated this week
- MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Co…☆5,806Updated 8 months ago
- TensorFlow's Visualization Toolkit☆6,777Updated last week
- Simplify your onnx model☆3,970Updated 5 months ago
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.☆28,905Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,915Updated this week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,556Updated this week
- Build and run Docker containers leveraging NVIDIA GPUs☆17,314Updated last year
- Optimized primitives for collective multi-GPU communication☆3,426Updated last week
- a language for fast, portable data-parallel computation☆5,961Updated this week
- ☆1,656Updated 6 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,268Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆35,227Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,525Updated this week
- AutoML library for deep learning☆9,196Updated last month
- A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.☆10,081Updated 8 months ago