LLM4Kernel: A Survey of Large Language Models for GPU Kernel Development
☆62Mar 31, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-LLM4Kernel
Users that are interested in Awesome-LLM4Kernel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo of CudaForge☆76Dec 2, 2025Updated 4 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆27Mar 30, 2026Updated 2 weeks ago
- From Minimal GEMM to Everything☆194Feb 10, 2026Updated 2 months ago
- The official implementation of the AAAI 2024 paper Bi-ViT.☆13Dec 18, 2023Updated 2 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated 2 months ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆45Mar 22, 2024Updated 2 years ago
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Nov 20, 2024Updated last year
- ☆11Aug 4, 2022Updated 3 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- Griffinfly is COSIC's submission to the ZPRIZE competition under the category, Accelerating NTT Operations on an FPGA by Michiel Van Beir…☆11Feb 13, 2023Updated 3 years ago
- Notebooks for the HE introduction☆10Sep 11, 2020Updated 5 years ago
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆121Jun 14, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jul 17, 2020Updated 5 years ago
- ☆14Aug 18, 2025Updated 7 months ago
- Vitis 部署加速器工作流介绍☆13Jan 10, 2025Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- ☆33Oct 13, 2025Updated 6 months ago
- [ICLR 2026] Official Implementation of "FeatureBench: Benchmarking Agentic Coding for Complex Feature Development"☆48Mar 31, 2026Updated 2 weeks ago
- introduce AI infra knowledges. 人工智能系统基础架构知识库☆16Jun 4, 2023Updated 2 years ago
- An implementation of SGEMV with performance comparable to cuBLAS.☆12May 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- ☆18Mar 4, 2025Updated last year
- Community contributions to the OpenFHE project☆14Mar 8, 2025Updated last year
- Automated High-Performance GPU Kernel Generation☆95Updated this week
- Intel HEXL library backend for OpenFHE, which uses AVX-512 instructions to accelerate the execution of OpenFHE cryptographic capabilities…☆18Mar 12, 2026Updated last month
- Multi-GPU acceleration for Fully Homomorphic Encryption☆23Jun 3, 2024Updated last year
- 哈工大 2021 秋季学期《数据结构与算法》课程作业与实验 | HIT Data Structure 2021☆18Oct 4, 2022Updated 3 years ago
- Output high level Pcode (PcodeAST) in Ghidra☆16Apr 7, 2023Updated 3 years ago
- ☆24Apr 10, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels☆157Apr 3, 2026Updated 2 weeks ago
- Compiler Principle Lab In NJU☆15Dec 24, 2017Updated 8 years ago
- Support material for the Building Accelerated Applications with Vitis webinar series☆17Sep 10, 2020Updated 5 years ago
- Homework of CMU 10-414/714: Deep Learning Systems (https://dlsyscourse.org/)☆15Mar 21, 2024Updated 2 years ago
- Economics of Ransomware | Dataset☆15May 2, 2018Updated 7 years ago
- A repository that compliments gpgpu-sim, providing automated regression scripts, simulation launching utilities and the code + arguments …☆77Aug 22, 2020Updated 5 years ago
- ☆12Aug 4, 2018Updated 7 years ago