olealgoritme / gddr6Links
Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.
☆98Updated last month
Alternatives and similar repositories for gddr6
Users that are interested in gddr6 are comparing it to the libraries listed below
Sorting:
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆42Updated 3 weeks ago
- ☆41Updated 2 years ago
- 8-bit CUDA functions for PyTorch☆53Updated 3 weeks ago
- ☆326Updated 2 months ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆19Updated 8 months ago
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- Running SXM2/SXM3/SXM4 NVidia data center GPUs in consumer PCs☆109Updated last year
- build scripts for ROCm☆186Updated last year
- ☆70Updated 5 months ago
- ☆130Updated 2 months ago
- Fast and memory-efficient exact attention☆173Updated this week
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA en…☆43Updated 9 months ago
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆204Updated 3 months ago
- Simple monkeypatch to boost AMD Navi 3 GPUs☆42Updated last month
- Make PyTorch models at least run on APUs.☆55Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆153Updated last year
- llama.cpp fork with additional SOTA quants and improved performance☆519Updated this week
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆67Updated 6 months ago
- ☆226Updated 2 years ago
- ☆75Updated this week
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆385Updated this week
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆49Updated 2 years ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆132Updated this week
- Train Llama Loras Easily☆30Updated last year
- Benchmark your GPU with ease☆19Updated last week
- AMD related optimizations for transformer models☆77Updated 7 months ago
- QuIP quantization☆52Updated last year
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆94Updated 10 months ago
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆86Updated last month
- Stable Diffusion and Flux in pure C/C++☆15Updated this week