kcxain/Awesome-LLM4Kernel

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kcxain/Awesome-LLM4Kernel)

kcxain / Awesome-LLM4Kernel

LLM4Kernel: A Survey of Large Language Models for GPU Kernel Development

☆76

Alternatives and similar repositories for Awesome-LLM4Kernel

Users that are interested in Awesome-LLM4Kernel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kcxain / Awesome-Kernel-Skills
View on GitHub
Public skills collected from well-known open-source projects focused on LLM infrastructure, GPU kernels, compiler/operator development
☆27May 7, 2026Updated 2 months ago
flagos-ai / awesome-LLM-driven-kernel-generation
View on GitHub
Review automated kernel generation in the era of LLMs
☆276Jun 25, 2026Updated last month
OptimAI-Lab / CudaForge
View on GitHub
Official Repo of CudaForge
☆85Dec 2, 2025Updated 7 months ago
0satan0 / KernelMem
View on GitHub
☆23Feb 14, 2026Updated 5 months ago
meta-pytorch / KernelAgent
View on GitHub
Autonomous GPU Kernel Generation & Optimization via Deep Agents
☆491Jul 15, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NVIDIA / SOL-ExecBench
View on GitHub
A benchmark of real-world DL kernel problems
☆265Jul 15, 2026Updated 2 weeks ago
ScalingIntelligence / KernelBench
View on GitHub
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
☆1,163Mar 24, 2026Updated 4 months ago
hkust-nlp / KernelGYM
View on GitHub
[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations [ICML…
☆196Mar 29, 2026Updated 4 months ago
meta-pytorch / popcorn-kernels
View on GitHub
For building the world's largest dataset of GPU kernels.
☆11Jul 17, 2026Updated last week
BytedTsinghua-SIA / CUDA-Agent
View on GitHub
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
☆1,123Jul 8, 2026Updated 3 weeks ago
YanjingLi0202 / Bi-ViT
View on GitHub
The official implementation of the AAAI 2024 paper Bi-ViT.
☆13Dec 18, 2023Updated 2 years ago
NVIDIA / compute-eval
View on GitHub
Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…
☆143May 19, 2026Updated 2 months ago
caoshiyi / K-Search
View on GitHub
Automated High-Performance GPU Kernel Generation
☆120Jun 1, 2026Updated last month
wzzll123 / MultiKernelBench
View on GitHub
MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation
☆66Jul 8, 2026Updated 3 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AI9Stars / AutoTriton
View on GitHub
☆66Jul 14, 2025Updated last year
flashinfer-ai / flashinfer-bench
View on GitHub
Building the Virtuous Cycle for AI-driven LLM Systems
☆261May 1, 2026Updated 2 months ago
thomasjoshi / agents-never-forget
View on GitHub
☆18May 18, 2025Updated last year
zhaiyi000 / tlm
View on GitHub
☆49Jul 13, 2024Updated 2 years ago
NVlabs / SOLAR
View on GitHub
Speed of Light Analysis for ML Model Runtime
☆108Jun 10, 2026Updated last month
mit-han-lab / KernelWiki
View on GitHub
☆314Jun 9, 2026Updated last month
BBuf / KDA-Pilot
View on GitHub
☆234Updated this week
KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
flagos-ai / FlagGems
View on GitHub
FlagGems is an operator library for large language models implemented in the Triton Language.
☆1,057Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Odysseusq / VLCache
View on GitHub
Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"
☆16Mar 28, 2026Updated 4 months ago
hossamfadeel / Verilog-Based-NoC-Simulator
View on GitHub
Verilog-Based-NoC-Simulator
☆12May 4, 2016Updated 10 years ago
NVlabs / KernelBlaster
View on GitHub
A framework for in context learning for code optimization
☆60Mar 14, 2026Updated 4 months ago
thunlp / TritonBench
View on GitHub
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
☆138Jun 14, 2025Updated last year
dsl-learn / triton-tutorial
View on GitHub
Getting Started with Triton: A Tutorial for Python Beginners
☆61Mar 26, 2026Updated 4 months ago
mit-han-lab / ncu-report-skill
View on GitHub
☆159May 24, 2026Updated 2 months ago
RightNow-AI / autokernel
View on GitHub
Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
☆1,483Mar 19, 2026Updated 4 months ago
kwaipilot / SWE-Compass
View on GitHub
☆18Mar 28, 2026Updated 4 months ago
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sablin39 / tilelang-cuda-skills
View on GitHub
Skills for writing tilelang and debugging with CUDA toolkits.
☆133May 20, 2026Updated 2 months ago
CUDA-Bench / CUDABench
View on GitHub
☆16Mar 4, 2026Updated 4 months ago
uchuhimo / amanda
View on GitHub
☆18Apr 21, 2024Updated 2 years ago
FuyuWang / Soter
View on GitHub
☆13Jan 7, 2025Updated last year
baco-authors / baco
View on GitHub
☆17Dec 8, 2023Updated 2 years ago
OpenAgentEval / SWE-ABS
View on GitHub
[ICML 2026] SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark
☆22May 6, 2026Updated 2 months ago
KuangjuX / ncu-cli
View on GitHub
Automated CUDA kernel performance diagnostics from NVIDIA Nsight Compute (NCU) CSV exports.
☆34Mar 18, 2026Updated 4 months ago