apache / tvmLinks
Open Machine Learning Compiler Framework
☆13,005Updated this week
Alternatives and similar repositories for tvm
Users that are interested in tvm are comparing it to the libraries listed below
Sorting:
- Compiler for Neural Network hardware accelerators☆3,323Updated last year
- Open standard for machine learning interoperability☆20,114Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,949Updated this week
- Development repository for the Triton language and compiler☆18,041Updated this week
- Transformer related optimization, including BERT, GPT☆6,378Updated last year
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,189Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆3,537Updated this week
- Optimized primitives for collective multi-GPU communication☆4,352Updated 2 weeks ago
- Ongoing research training transformer models at scale☆14,798Updated this week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,076Updated this week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,590Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,880Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,644Updated last month
- Tutorials for creating and using ONNX models☆3,639Updated last year
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,213Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆18,841Updated this week
- Simplify your onnx model☆4,261Updated 4 months ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,546Updated 3 weeks ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆96,395Updated this week
- A list of awesome compiler projects and papers for tensor computation and deep learning.☆2,705Updated last year
- ☆1,974Updated 2 years ago
- A collection of pre-trained, state-of-the-art models in the ONNX format☆9,324Updated 3 months ago
- Low-precision matrix multiplication☆1,826Updated last year
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Updated 2 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆2,017Updated 7 years ago
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆7,195Updated last week
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM …☆13,851Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,693Updated 3 weeks ago
- A high performance and generic framework for distributed DNN training☆3,716Updated 2 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,888Updated 2 weeks ago