AMD-AGI/GEAK

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMD-AGI/GEAK)

AMD-AGI / GEAK

Generating Efficient AI-Centric Kernels

☆131

Alternatives and similar repositories for GEAK

Users that are interested in GEAK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AMD-AGI / TraceLens
View on GitHub
Automating analysis from trace files
☆84Updated this week
meta-pytorch / KernelAgent
View on GitHub
Autonomous GPU Kernel Generation & Optimization via Deep Agents
☆490Jul 15, 2026Updated last week
AMD-AGI / Magpie
View on GitHub
A lightweight, general-purpose framework for evaluating GPU kernel and benchmark.
☆57Updated this week
ROCm / TransformerEngine
View on GitHub
☆72Updated this week
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆249Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
thunlp / TritonBench
View on GitHub
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
☆137Jun 14, 2025Updated last year
AMDResearch / intellikit
View on GitHub
IntelliKit is a collection of intelligent tools designed to make GPU kernel development, profiling, and validation accessible to LLMs and…
☆27Updated this week
ROCm / aiter
View on GitHub
AI Tensor Engine for ROCm
☆503Updated this week
ROCm / rocprof-compute-viewer
View on GitHub
☆62Jul 16, 2026Updated last week
AMDResearch / intelliperf
View on GitHub
Automated bottleneck detection and solution orchestration
☆23Feb 24, 2026Updated 5 months ago
ScalingIntelligence / KernelBench
View on GitHub
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
☆1,156Mar 24, 2026Updated 4 months ago
ROCm / ATOM
View on GitHub
AiTer Optimized Model
☆144Updated this week
flashinfer-ai / flashinfer-bench
View on GitHub
Building the Virtuous Cycle for AI-driven LLM Systems
☆261May 1, 2026Updated 2 months ago
AMD-AGI / Primus-SaFE
View on GitHub
Primus-SaFE(Stability and Fault Endurance)
☆58Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ROCm / aotriton
View on GitHub
Ahead of Time (AOT) Triton Math Library
☆100Updated this week
hkust-nlp / KernelGYM
View on GitHub
[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations [ICML…
☆196Mar 29, 2026Updated 3 months ago
cross-entropy-ai / deck
View on GitHub
Tmux sidebar for vibe coding. Manage sessions and monitor agents at a glance
☆15Jul 19, 2026Updated last week
ROCm / rocmProfileData
View on GitHub
☆30Updated this week
ScalingIntelligence / good-kernels
View on GitHub
Samples of good AI generated CUDA kernels
☆106May 30, 2025Updated last year
SakanaAI / robust-kbench
View on GitHub
☆101Nov 22, 2025Updated 8 months ago
NVIDIA / SOL-ExecBench
View on GitHub
A benchmark of real-world DL kernel problems
☆263Jul 15, 2026Updated last week
meta-pytorch / tritonbench
View on GitHub
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆362Updated this week
caoshiyi / K-Search
View on GitHub
Automated High-Performance GPU Kernel Generation
☆120Jun 1, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zhang677 / AccelOpt
View on GitHub
[MLSys 2026] AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization
☆57Jun 18, 2026Updated last month
wafer-ai / kernel-arena
View on GitHub
Public benchmark results from Kernel Arena, a leaderboard for LLM-generated AI accelerator kernels.
☆21Mar 11, 2026Updated 4 months ago
wzzll123 / MultiKernelBench
View on GitHub
MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation
☆64Jul 8, 2026Updated 2 weeks ago
HazyResearch / HipKittens
View on GitHub
Fast and Furious AMD Kernels
☆446Jul 10, 2026Updated 2 weeks ago
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated last week
RightNow-AI / autokernel
View on GitHub
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
☆1,479Mar 19, 2026Updated 4 months ago
ROCm / composable_kernel
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
☆539Updated this week
ROCm / rocm-systems
View on GitHub
super repo for rocm systems projects
☆443Updated this week
luongthecong123 / fp8-quant-matmul
View on GitHub
Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.
☆19Feb 9, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Xtra-Computing / raintreebook
View on GitHub
A book about Ph.D. student and research career planning
☆29Oct 21, 2025Updated 9 months ago
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆193Updated this week
ROCm / MAD
View on GitHub
MAD (Model Automation and Dashboarding)
☆39Updated this week
ScalingIntelligence / caesar
View on GitHub
Throughput-oriented multi-turn inference engine for KernelBench [ICML '25]
☆24May 27, 2025Updated last year
AMD-AGI / Primus-Turbo
View on GitHub
A high-performance acceleration library dedicated to large-scale model training on AMD GPUs
☆67Updated this week
KernelFlow-ops / cuda-optimized-skill
View on GitHub
A CUDA kernel optimization toolkit for validation, benchmarking, Nsight Compute profiling, bottleneck analysis, and iterative tuning. It …
☆191Apr 22, 2026Updated 3 months ago
AMD-AGI / Primus
View on GitHub
A flexible and high-performance training framework designed for large-scale foundation model training on AMD GPUs
☆108Updated this week