pomoke / torch-apu-helperLinks
Make PyTorch models at least run on APUs.
☆56Updated last year
Alternatives and similar repositories for torch-apu-helper
Users that are interested in torch-apu-helper are comparing it to the libraries listed below
Sorting:
- ☆63Updated 6 months ago
- ☆496Updated this week
- build scripts for ROCm☆188Updated last year
- ☆235Updated 2 years ago
- ☆418Updated 8 months ago
- 8-bit CUDA functions for PyTorch☆68Updated 2 months ago
- ☆53Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆110Updated 3 weeks ago
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆216Updated last week
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆260Updated last week
- Because RKNPU only knows 4D☆39Updated last year
- ☆64Updated last year
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆106Updated 7 months ago
- ☆48Updated 2 years ago
- Download models from the Ollama library, without Ollama☆115Updated last year
- Deep Learning Primitives and Mini-Framework for OpenCL☆205Updated last year
- No-code CLI designed for accelerating ONNX workflows☆219Updated 5 months ago
- Customized ACPI method for overriding mobile AMD APU STAPM values☆41Updated 6 years ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆84Updated last week
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆51Updated 2 years ago
- AMD related optimizations for transformer models☆96Updated last month
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Updated 10 months ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆65Updated 7 months ago
- ☆56Updated 2 years ago
- ☆18Updated 11 months ago
- ☆171Updated last week
- Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS☆68Updated 2 years ago
- ROCm docker images with fixes/support for legecy architecture gfx803. eg.Radeon RX 590/RX 580/RX 570/RX 480☆77Updated 6 months ago
- NVIDIA Linux open GPU with P2P support☆94Updated this week
- Running SXM2/SXM3/SXM4 NVidia data center GPUs in consumer PCs☆132Updated 2 years ago