nlzy/triton-gfx906

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nlzy/triton-gfx906)

nlzy / triton-gfx906

triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60

☆48

Alternatives and similar repositories for triton-gfx906

Users that are interested in triton-gfx906 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nlzy / vllm-gfx906
View on GitHub
vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆433Feb 20, 2026Updated 5 months ago
mixa3607 / ML-gfx906
View on GitHub
ML software (llama.cpp, ComfyUI, vLLM) builds for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆294Updated this week
ai-infos / vllm-gfx906-mobydick
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs - Optimized for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI…
☆77Jun 23, 2026Updated last month
iacopPBK / llama.cpp-gfx906
View on GitHub
llama.cpp-gfx906
☆139Mar 22, 2026Updated 4 months ago
Said-Akbar / vllm-rocm
View on GitHub
FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs
☆71May 4, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Said-Akbar / triton-gcn5
View on GitHub
Triton for AMD MI25/50/60. Development repository for the Triton language and compiler
☆34Dec 15, 2025Updated 7 months ago
PowerfulGhost / vllm-mi50
View on GitHub
Modified for AMD MI50 GPUs | A high-throughput and memory-efficient inference and serving engine for LLMs
☆15Mar 2, 2026Updated 4 months ago
sh1ma / voicevoxcore.go
View on GitHub
Voicevox Coreのラッパーライブラリ
☆19May 20, 2024Updated 2 years ago
ai-infos / guidances-setup-16-mi50-deepseek-v32
View on GitHub
Guidances for Test setup of 16 AMD MI50 32GB (for Deepseek v3.2)
☆27May 11, 2026Updated 2 months ago
Teachings / FastAgentAPI
View on GitHub
Proxy for OpenAI
☆16Sep 2, 2025Updated 10 months ago
mingyi456 / ComfyUI-DFloat11-Extended
View on GitHub
Fork of the official DF11 ComfyUI custom node that aims to support other model architectures, and add support for LoRAs (selected models …
☆54Jul 16, 2026Updated last week
paudley / ai-notes
View on GitHub
Random AI notes for working with local models or playing around with random machine learning bits.
☆61Jun 7, 2026Updated last month
poad42 / cuda-fp8-ampere
View on GitHub
IMMA-based **FP8-as-storage** GEMM experiments for Ampere (sm_86 / RTX 3090 Ti).
☆24Jan 30, 2026Updated 6 months ago
pocke / goevent
View on GitHub
goevent is event dispatcher written by golang.
☆13Jan 17, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jumppad-labs / hclconfig
View on GitHub
Configuration parser for the HashiCorp Configuration Language (HCL)
☆13Jul 15, 2026Updated 2 weeks ago
sasha0552 / pascal-pkgs-ci
View on GitHub
The main repository for building Pascal-compatible versions of ML applications and libraries.
☆214Aug 23, 2025Updated 11 months ago
ZigEmbeddedGroup / aviron
View on GitHub
Configurable AVR simulator
☆17Apr 20, 2025Updated last year
amd / ZenDNN-pytorch-plugin
View on GitHub
☆33Updated this week
open-o11y / opentelemetry-collector-testing
View on GitHub
☆18Nov 2, 2020Updated 5 years ago
Mustafa-Esoofally / ML-assistant-Crew
View on GitHub
☆14Sep 4, 2024Updated last year
Green0-0 / propagate
View on GitHub
Evolutionary strategies finetuning library for LLMs
☆25Jun 29, 2026Updated last month
Felliks / DoomVLM
View on GitHub
AI plays Doom — pit Vision Language Models against demons and each other. Solo scenarios, deathmatch arena, 1-4 agents with any OpenAI-co…
☆20Mar 12, 2026Updated 4 months ago
RAZZULLIX / fast_topk_batched
View on GitHub
High-performance batched Top-K selection for CPU inference. Up to 80x faster than PyTorch, optimized for LLM sampling with AVX2 SIMD.
☆18Mar 20, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
immohitsen / RAG-Chat
View on GitHub
A premium RAG-based AI Assistant built with React and FastAPI. Features efficient document indexing and high-accuracy retrieval-augmented…
☆18Jun 3, 2026Updated last month
FairyDevicesRD / thinklet.squid.run
View on GitHub
THINKLETから直接 Youtube Live にストリーミング配信をする
☆10Dec 10, 2024Updated last year
sorryhyun / ComfyUI-Spectrum-KSampler
View on GitHub
☆27Jul 20, 2026Updated last week
codysnider / tagmem
View on GitHub
Structured local memory storage and retrieval for LLM agents
☆15May 19, 2026Updated 2 months ago
gschaidergabriel / lcme
View on GitHub
Neural-enhanced conversational memory for AI agents — 10 micro-networks, tri-hybrid storage, bio-inspired retrieval
☆15Mar 27, 2026Updated 4 months ago
ParmesanParty / llama.cpp
View on GitHub
Specialized fork for (relatively) fast single-GPU inference (in CUDA) using large MoE models that don't fit fully into VRAM
☆17May 6, 2026Updated 2 months ago
ikawrakow / ik_llama.cpp
View on GitHub
llama.cpp fork with additional SOTA quants and improved performance
☆2,969Updated this week
waybarrios / dgx-spark-finetune-llm
View on GitHub
LLM fine-tuning with LoRA + NVFP4/MXFP8 on NVIDIA DGX Spark (Blackwell GB10)
☆21Dec 22, 2025Updated 7 months ago
xinKyy / react-native-kline-chart
View on GitHub
High-performance K-line (Candlestick) chart for React Native, powered by Skia. Smooth, customizable, and built for real trading apps.
☆19Mar 20, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jllllll / bitsandbytes
View on GitHub
8-bit CUDA functions for PyTorch
☆27Nov 18, 2023Updated 2 years ago
amoghmunikote / 170th-Street
View on GitHub
The most comprehensive community resource for the NVIDIA CMP 170HX.
☆48Updated this week
plctlab / riscv-cluster
View on GitHub
Towards a million-node RISC-V cluster.
☆14Mar 6, 2025Updated last year
skyne98 / llama.cpp-gfx906
View on GitHub
LLM inference in C/C++, but for GFX906!
☆19Jul 15, 2026Updated 2 weeks ago
ahyatt / llm-buddy
View on GitHub
☆26Updated this week
HadarDavidson / colored-noise-sampling
View on GitHub
Official Implementation of "Colored Noise Diffusion Sampling"
☆39Jun 1, 2026Updated last month
i-evi / sse2msa
View on GitHub
A C/C++ header file that converts Intel SSE intrinsics to MIPS/MIPS64 MSA intrinsics.
☆10Nov 16, 2021Updated 4 years ago