pomoke / torch-apu-helperLinks
Make PyTorch models at least run on APUs.
☆54Updated last year
Alternatives and similar repositories for torch-apu-helper
Users that are interested in torch-apu-helper are comparing it to the libraries listed below
Sorting:
- ☆59Updated 2 months ago
- Reverse engineering the rk3588 npu☆89Updated last year
- ☆54Updated last year
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆101Updated 2 months ago
- ☆233Updated 2 years ago
- build scripts for ROCm☆186Updated last year
- Because RKNPU only knows 4D☆36Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆97Updated 2 weeks ago
- NVIDIA Linux open GPU with P2P support☆25Updated last month
- Deep Learning Primitives and Mini-Framework for OpenCL☆199Updated 10 months ago
- Download models from the Ollama library, without Ollama☆89Updated 8 months ago
- ☆37Updated 2 years ago
- ☆42Updated 2 years ago
- Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS☆64Updated last year
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆209Updated 4 months ago
- Lightweight Inference server for OpenVINO☆188Updated this week
- Running SXM2/SXM3/SXM4 NVidia data center GPUs in consumer PCs☆115Updated 2 years ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆72Updated 5 months ago
- ☆56Updated 2 years ago
- Customized ACPI method for overriding mobile AMD APU STAPM values☆37Updated 6 years ago
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- A set of utilities for monitoring and customizing GPU performance☆153Updated last year
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆50Updated 2 years ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated 2 months ago
- AMD related optimizations for transformer models☆80Updated 3 weeks ago
- GPU Power and Performance Manager☆60Updated 9 months ago
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆230Updated last week
- Kernel/Qemu Patches for Venus.☆32Updated 11 months ago
- Local LLM Server with GPU and NPU Acceleration☆206Updated this week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆89Updated last month