aieater / rocm_pytorch_informations

The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate information based on the knowledge gained by GPUEater infrastructure development.

☆87

Alternatives and similar repositories for rocm_pytorch_informations:

Users that are interested in rocm_pytorch_informations are comparing it to the libraries listed below

ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆223Updated this week
nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 3 months ago
huggingface / pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
☆545Updated 4 years ago
octoml / Apple-M1-BERT
3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1
☆136Updated 3 years ago
thomasbrandon / mish-cuda
Mish Activation Function for PyTorch
☆148Updated 4 years ago
r0mainK / outperformer
Code for scaling Transformers
☆26Updated 4 years ago
mit-han-lab / neurips-micronet
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
☆40Updated 4 years ago
TezRomacH / layer-to-layer-pytorch
PyTorch implementation of L2L execution algorithm
☆107Updated 2 years ago
ryujaehun / pytorch-gpu-benchmark
Using the famous cnn model in Pytorch, we run benchmarks on various gpu.
☆234Updated 10 months ago
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆359Updated 2 months ago
pytorch / nestedtensor
[Prototype] Tools for the concurrent manipulation of variably sized Tensors.
☆251Updated 2 years ago
shawwn / ml-notes
☆39Updated 2 years ago
adityaiitb / pyprof2
PyProf2: PyTorch Profiling tool
☆82Updated 4 years ago
cybertronai / pytorch-lamb
Implementation of https://arxiv.org/abs/1904.00962
☆374Updated 4 years ago
IBM / pytorch-large-model-support
Large Model Support in PyTorch
☆133Updated 3 years ago
artyom-beilis / dlprimitives
Deep Learning Primitives and Mini-Framework for OpenCL
☆193Updated 7 months ago
szymonmaszke / torchdatasets
PyTorch dataset extended with map, cache etc. (tensorflow.data like)
☆329Updated 2 years ago
AminRezaei0x443 / PyTorch-LIT
Lite Inference Toolkit (LIT) for PyTorch
☆161Updated 3 years ago
huggingface / tune
☆87Updated 2 years ago
facebookresearch / diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …
☆235Updated last year
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆54Updated 3 years ago
kpu / intgemm
int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991
☆71Updated last year
NVlabs / tensorcom
☆109Updated 4 years ago
ofnote / tsalib
Tensor Shape Annotation Library (numpy, tensorflow, pytorch, ...)
☆265Updated 4 years ago
IntelLabs / SLIDE_opt_ia
☆74Updated last year
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆716Updated 2 years ago
NervanaSystems / ngraph-onnx
nGraph™ Backend for ONNX
☆42Updated 2 years ago
kartik4949 / deepops
a mini Deep Learning framework supporting GPU accelerations written with CUDA
☆32Updated 4 years ago
rossumai / nvgpu
NVIDIA GPU tools - monitoring on CLI & web app with multiple agents
☆87Updated 11 months ago
pytorch / extension-script
Example repository for custom C++/CUDA operators for TorchScript
☆114Updated 2 years ago