zhang677/AccelOpt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhang677/AccelOpt)

zhang677 / AccelOpt

[MLSys 2026] AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization

☆57

Alternatives and similar repositories for AccelOpt

Users that are interested in AccelOpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhang677 / PCL-lite
View on GitHub
[ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development
☆17Jan 6, 2026Updated 6 months ago
GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
flashinfer-ai / flashinfer-bench
View on GitHub
Building the Virtuous Cycle for AI-driven LLM Systems
☆261May 1, 2026Updated 2 months ago
aws-neuron / nki-library
View on GitHub
☆68Jul 14, 2026Updated last week
uw-syfi / vibesys
View on GitHub
Can AI Agents Build Bespoke Systems?
☆85Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
nex-agi / NexVenusCL
View on GitHub
Nex Venus Communication Library
☆75Nov 17, 2025Updated 8 months ago
meta-pytorch / KernelAgent
View on GitHub
Autonomous GPU Kernel Generation & Optimization via Deep Agents
☆490Jul 15, 2026Updated last week
ScalingIntelligence / KernelBench
View on GitHub
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
☆1,156Mar 24, 2026Updated 4 months ago
amd / Triton-XDNA
View on GitHub
☆46Jul 14, 2026Updated last week
flashinfer-ai / cubloaty
View on GitHub
a size profiler for cuda binary
☆71Jan 15, 2026Updated 6 months ago
comaniac / epoi
View on GitHub
Benchmark PyTorch Custom Operators
☆14Jul 6, 2023Updated 3 years ago
ChijinZ / PolyJuice-Fuzzer
View on GitHub
A DL compiler fuzzer
☆15Nov 1, 2024Updated last year
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Jul 17, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
inclusionAI / humming
View on GitHub
☆165Updated this week
Deep-Learning-Profiling-Tools / fasten
View on GitHub
☆14Apr 24, 2024Updated 2 years ago
ademeure / DeeperGEMM
View on GitHub
DeeperGEMM: crazy optimized version
☆86May 5, 2025Updated last year
uwsampl / paper-agents
View on GitHub
☆13Dec 9, 2024Updated last year
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
Infini-AI-Lab / Sparrow
View on GitHub
☆16Jun 15, 2026Updated last month
xinhao-luo / ClusterFusion
View on GitHub
[NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
☆75Dec 11, 2025Updated 7 months ago
AMD-AGI / GEAK
View on GitHub
Generating Efficient AI-Centric Kernels
☆130Updated this week
UCLA-VAST / heterohalide
View on GitHub
HeteroHalide: From Image Processing DSL to Efficient FPGA Acceleration
☆15Sep 14, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
flashinfer-ai / mlsys26-agent-baseline
View on GitHub
☆33Mar 12, 2026Updated 4 months ago
Inference-and-Optimization / High-Level-Synthesis-Study-Notes
View on GitHub
Vivado HLS study notes, courses, documents.
☆12Dec 7, 2019Updated 6 years ago
samkaufman / morello
View on GitHub
☆17Jul 7, 2026Updated 2 weeks ago
vllm-project / vllm-nccl
View on GitHub
Manages vllm-nccl dependency
☆18Jun 3, 2024Updated 2 years ago
aws-neuron / nki-llama
View on GitHub
Project showing how to develop NKI kernels for Llama 3.2 1B inference
☆21May 29, 2025Updated last year
aws-neuron / nki-moe
View on GitHub
MLSys competition for the best MOE NKI kernels
☆48May 29, 2026Updated last month
stanford-cs149 / asst4-trainium2
View on GitHub
☆19Nov 21, 2025Updated 8 months ago
kotoba-tech / Open-GPT-4o
View on GitHub
☆10May 16, 2024Updated 2 years ago
geraudnt / boolean_composition
View on GitHub
Code for the paper "A Boolean Task Algebra For Reinforcement Learning"
☆11Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thu-nics / nicsefc-readme
View on GitHub
some docs for rookies in nics-efc
☆22Mar 17, 2022Updated 4 years ago
SakanaAI / robust-kbench
View on GitHub
☆101Nov 22, 2025Updated 8 months ago
NVIDIA / SOL-ExecBench
View on GitHub
A benchmark of real-world DL kernel problems
☆263Jul 15, 2026Updated last week
mit-han-lab / SMEPO
View on GitHub
☆16May 27, 2026Updated last month
wzzll123 / MultiKernelBench
View on GitHub
MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation
☆64Jul 8, 2026Updated 2 weeks ago
Infini-AI-Lab / astraflow
View on GitHub
Dataflow-Oriented Reinforcement Learning for (Multi-)Agentic LLMs
☆96Updated this week
aws-samples / end-2-end-3d-ml
View on GitHub
This repository features Amazon SageMaker Ground Truth and explains how to ingest raw 3D point cloud data, label it, train a 3D object de…
☆13Jun 23, 2022Updated 4 years ago