NVIDIA / TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
☆10,764Updated this week
Related projects ⓘ
Alternatives and complementary repositories for TensorRT
- ONNX-TensorRT: TensorRT backend for ONNX☆2,948Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,296Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,585Updated this week
- An easy to use PyTorch to TensorRT converter☆4,599Updated 2 months ago
- Simplify your onnx model☆3,849Updated 2 months ago
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆14,642Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,761Updated this week
- Open standard for machine learning interoperability☆17,893Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,320Updated 2 months ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,392Updated last week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,139Updated this week
- A collection of pre-trained, state-of-the-art models in the ONNX format☆7,919Updated 6 months ago
- Transformer related optimization, including BERT, GPT☆5,871Updated 7 months ago
- Visualizer for neural network, deep learning and machine learning models☆28,075Updated this week
- Development repository for the Triton language and compiler☆13,311Updated this week
- Implementation of popular deep learning networks with TensorRT network definition API☆6,984Updated 2 weeks ago
- Serve, optimize and scale PyTorch models in production☆4,209Updated 2 weeks ago
- OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference☆7,232Updated this week
- CUDA Templates for Linear Algebra Subroutines☆5,629Updated this week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enter…☆13,529Updated 2 months ago
- Google Brain AutoML☆6,245Updated 7 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆7,911Updated this week
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba☆8,712Updated last week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆30,426Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆20,436Updated this week
- OpenMMLab Computer Vision Foundation☆5,892Updated this week
- Tutorials for creating and using ONNX models☆3,372Updated 3 months ago
- Fast and memory-efficient exact attention☆14,109Updated this week
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,373Updated last month
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,553Updated 2 weeks ago