LLM4Kernel: A Survey of Large Language Models for GPU Kernel Development
☆66Mar 31, 2026Updated last month
Alternatives and similar repositories for Awesome-LLM4Kernel
Users that are interested in Awesome-LLM4Kernel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo of CudaForge☆78Dec 2, 2025Updated 5 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆27Mar 30, 2026Updated last month
- From Minimal GEMM to Everything☆202Feb 10, 2026Updated 2 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [EMNLP 2024] Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction☆17Nov 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 6 months ago
- ☆47Apr 7, 2026Updated last month
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated 3 months ago
- A community-driven pypto implementation☆63Updated this week
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- A curated list of PhD, RA, and Intern openings in Computer Science (CS), Electrical & Computer Engineering (ECE), and Artificial Intellig…☆21Sep 1, 2025Updated 8 months ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆45Mar 22, 2024Updated 2 years ago
- Review automated kernel generation in the era of LLMs☆191Apr 9, 2026Updated last month
- ☆18Apr 8, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI'26] Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression☆19Dec 21, 2025Updated 4 months ago
- ☆11Aug 4, 2022Updated 3 years ago
- Here is the resources and code for the LotteryCodec.☆26Nov 3, 2025Updated 6 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- Griffinfly is COSIC's submission to the ZPRIZE competition under the category, Accelerating NTT Operations on an FPGA by Michiel Van Beir…☆11Feb 13, 2023Updated 3 years ago
- [TMM 2025] Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression☆15Mar 28, 2025Updated last year
- Code for DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.☆11Apr 13, 2022Updated 4 years ago
- Notebooks for the HE introduction☆10Sep 11, 2020Updated 5 years ago
- Unoffical Pytorch Implementation of Improving Inference for Neural Image Compression☆15Apr 27, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆126Jun 14, 2025Updated 10 months ago
- This is my graduation project, a simple processor soft core, which implements RV32I ISA.☆16May 23, 2019Updated 6 years ago
- Vitis 部署加速器工作流介绍☆13Jan 10, 2025Updated last year
- ☆14Aug 18, 2025Updated 8 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- Clustering by fast search and find of density peaks☆13Jan 8, 2016Updated 10 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal☆67May 2, 2026Updated last week
- Awesome LLM for NLG Evaluation Papers☆26Jan 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 2025 Highlight] official code of paper "DLF: Extreme Image Compression with Dual-generative Latent Fusion"☆45Dec 24, 2025Updated 4 months ago
- [TCSVT 2023] RDO-PTQ: Rate-Distortion Optimized Post-Training Quantization for Learned Image Compression☆19Nov 1, 2023Updated 2 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- ☆21Sep 10, 2021Updated 4 years ago
- ☆31Oct 30, 2024Updated last year
- ☆12Mar 28, 2024Updated 2 years ago
- A boundary detection algorithm in microscopic images considering 3D information.☆13Sep 19, 2018Updated 7 years ago