aieater / rocm_pytorch_informationsLinks
The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate information based on the knowledge gained by GPUEater infrastructure development.
☆87Updated 4 years ago
Alternatives and similar repositories for rocm_pytorch_informations
Users that are interested in rocm_pytorch_informations are comparing it to the libraries listed below
Sorting:
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆137Updated 3 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆238Updated this week
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 2 years ago
- Accelerate PyTorch models with ONNX Runtime☆363Updated 6 months ago
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆236Updated 2 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 7 months ago
- Fast Block Sparse Matrices for Pytorch☆549Updated 4 years ago
- A small demonstration of using WebDataset with ImageNet and PyTorch Lightning☆75Updated last year
- Lite Inference Toolkit (LIT) for PyTorch☆161Updated 3 years ago
- Tensorflow Wheels☆135Updated 3 years ago
- ☆74Updated last year
- Simple gradient checkpointing for eager mode execution☆46Updated 4 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆129Updated 3 years ago
- Babysit your preemptible TPUs☆86Updated 2 years ago
- Haste: a fast, simple, and open RNN library☆333Updated 2 years ago
- Adaptive Gradient Clipping☆144Updated 2 years ago
- PyTorch implementation of L2L execution algorithm☆108Updated 2 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated 2 years ago
- Implementation of Feedback Transformer in Pytorch☆107Updated 4 years ago
- Make TFRecord Usable Again☆89Updated 2 years ago
- Pytorch Lightning Distributed Accelerators using Ray☆214Updated last year
- Large Model Support in PyTorch☆134Updated 3 years ago
- Library for 8-bit optimizers and quantization routines.☆777Updated 3 years ago
- Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.☆131Updated last year
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆133Updated 3 years ago
- Torch Distributed Experimental☆117Updated last year
- ☆109Updated 4 years ago
- NVIDIA GPU tools - monitoring on CLI & web app with multiple agents☆89Updated last year
- A logging tool for deep learning.☆60Updated 5 months ago
- Benchmark Suite for Deep Learning☆274Updated 6 months ago