Review automated kernel generation in the era of LLMs
☆214May 14, 2026Updated last week
Alternatives and similar repositories for awesome-LLM-driven-kernel-generation
Users that are interested in awesome-LLM-driven-kernel-generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlagTree is a unified compiler supporting multiple AI chip backends for custom Deep Learning operations, which is forked from triton-lang …☆269Updated this week
- Building the Virtuous Cycle for AI-driven LLM Systems☆229May 1, 2026Updated 3 weeks ago
- Repository for AI model benchmarking on TT-Buda☆16Feb 9, 2026Updated 3 months ago
- ☆65Jul 14, 2025Updated 10 months ago
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆72May 13, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of TBA for async LLM post-training.☆31Nov 5, 2025Updated 6 months ago
- Buda Compiler Backend for Tenstorrent devices☆31Apr 2, 2025Updated last year
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- ☆52Jun 14, 2024Updated last year
- ☆19Jun 3, 2023Updated 2 years ago
- FlagGems is an operator library for large language models implemented in the Triton Language.☆1,001Updated this week
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 3 years ago
- FlagScale is a large model toolkit based on open-sourced projects.☆514May 19, 2026Updated last week
- Implementation of the Reusable Enclaves paper☆14Sep 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- Getting Started with the Core Slicing Prototype☆13Jun 2, 2023Updated 2 years ago
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆26Feb 1, 2024Updated 2 years ago
- Autonomous GPU Kernel Generation & Optimization via Deep Agents☆430May 19, 2026Updated last week
- ☆11Dec 4, 2023Updated 2 years ago
- A curated list of Security Big4 papers for Privacy, Mobile Security and Access Control.☆14Oct 8, 2024Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- qwen-nsa☆87Oct 14, 2025Updated 7 months ago
- MB SeedKey Algos: tests sandbox, reverse engineering, etc☆16Apr 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.☆108Dec 17, 2025Updated 5 months ago
- ☆13May 12, 2025Updated last year
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- SqueezeNet Generator☆31May 7, 2018Updated 8 years ago
- Generic build server