jrcichra / rocm-pytorch-gfx803Links
Copy of rocm/pytorch with gfx803 cards compiled in (see https://github.com/xuhuisheng/rocm-build/blob/develop/docs/gfx803.md)
☆20Updated 4 years ago
Alternatives and similar repositories for rocm-pytorch-gfx803
Users that are interested in rocm-pytorch-gfx803 are comparing it to the libraries listed below
Sorting:
- A Docker image based on rocm/pytorch with support for gfx803(Polaris 20-21 (XT/PRO/XL); RX580; RX570; RX560) and Python 3.8☆24Updated 2 years ago
- A install guide for the RX580☆37Updated 4 years ago
- ☆226Updated 2 years ago
- Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS☆63Updated last year
- build scripts for ROCm☆186Updated last year
- Install guide of ROCm and Tensorflow on Ubuntu for the RX580☆125Updated 8 months ago
- ☆37Updated 2 years ago
- ROCm docker images with fixes/support for extra architectures, such as gfx803/gfx1010.☆30Updated last year
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆132Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆30Updated this week
- 8-bit CUDA functions for PyTorch☆53Updated 3 weeks ago
- Deep Learning Primitives and Mini-Framework for OpenCL☆197Updated 8 months ago
- ROCm docker images with fixes/support for legecy architecture gfx803. eg.Radeon RX 590/RX 580/RX 570/RX 480☆64Updated 2 weeks ago
- Stable Diffusion GUI written in C++☆58Updated last month
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- Run Large Language Models on RK3588 with GPU-acceleration☆104Updated last year
- DLPrimitives/OpenCL out of tree backend for pytorch☆350Updated 9 months ago
- ☆326Updated 2 months ago
- Make PyTorch models at least run on APUs.☆55Updated last year
- Use safetensors with ONNX 🤗☆61Updated 3 months ago
- ☆56Updated 2 years ago
- llama.cpp fork used by GPT4All☆55Updated 3 months ago
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆94Updated last week
- rocDecode is a high performance video decode SDK for AMD hardware☆26Updated this week
- ☆16Updated 2 years ago
- Run Pytorch with ROCm hardware acceleration on an RX590 (or similar GPU)☆23Updated 2 years ago
- AMD related optimizations for transformer models☆77Updated 7 months ago
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA en…☆43Updated 9 months ago
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆49Updated 2 years ago
- ☆20Updated this week