aieater / rocm_pytorch_informations
The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate information based on the knowledge gained by GPUEater infrastructure development.
โ87Updated 4 years ago
Alternatives and similar repositories for rocm_pytorch_informations:
Users that are interested in rocm_pytorch_informations are comparing it to the libraries listed below
- Tensors and Dynamic neural networks in Python with strong GPU accelerationโ221Updated this week
- Nod.ai ๐ฆ version of ๐ป . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository โฆโ106Updated 2 months ago
- Fast Block Sparse Matrices for Pytorchโ546Updated 4 years ago
- 3X speedup over Appleโs TensorFlow plugin by using Apache TVM on M1โ136Updated 2 years ago
- Large Model Support in PyTorchโ133Updated 3 years ago
- PyTorch implementation of L2L execution algorithmโ107Updated 2 years ago
- Productionize machine learning predictions, with ONNX or withoutโ65Updated last year
- Using the famous cnn model in Pytorch, we run benchmarks on various gpu.โ234Updated 9 months ago
- โ15Updated 3 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compreโฆโ322Updated last week
- NVIDIA GPU tools - monitoring on CLI & web app with multiple agentsโ87Updated 10 months ago
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objaxโ128Updated last year
- Torch Distributed Experimentalโ115Updated 7 months ago
- โ74Updated last year
- Code for scaling Transformersโ26Updated 4 years ago
- Lite Inference Toolkit (LIT) for PyTorchโ161Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ126Updated 4 years ago
- โ30Updated 4 years ago
- Concise deep learning for JAXโ184Updated 4 years ago
- GPU fan control for headless Linuxโ340Updated last year
- Functional deep learningโ108Updated 2 years ago
- A Pytree Module system for Deep Learning in JAXโ213Updated 2 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructureโ106Updated last year
- A queue service for quickly developing scripts that use all your GPUs efficientlyโ83Updated 2 years ago
- ๐ฉ Pytorch and Jax code for the Madam optimiser.โ51Updated 4 years ago
- Accelerate PyTorch models with ONNX Runtimeโ358Updated last month
- Partial implementation of NVIDIAยฎ cuDNN API for Coriander, OpenCL 1.2โ22Updated 7 years ago
- โ53Updated 4 years ago
- Template repository for a Python 3-based data science project that uses Horovod.โ43Updated 3 years ago
- Customized matrix multiplication kernelsโ54Updated 3 years ago