Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model
☆91Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for K-Search
Users that are interested in K-Search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo of CudaForge☆70Dec 2, 2025Updated 3 months ago
- Ship correct and fast LLM kernels to PyTorch☆147Jan 14, 2026Updated 2 months ago
- Advancing the frontier of efficient AI☆55Mar 18, 2026Updated last week
- ☆91Nov 22, 2025Updated 4 months ago
- Samples of good AI generated CUDA kernels☆102May 30, 2025Updated 9 months ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 4 months ago
- ☆12Apr 9, 2025Updated 11 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆869Mar 9, 2026Updated 2 weeks ago
- ☆32Jul 2, 2025Updated 8 months ago
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Utility that parses stack sizes section from elf objects and displays the preallocated stack size of each function.☆14Jan 15, 2020Updated 6 years ago
- ☆11Jan 19, 2025Updated last year
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 6 months ago
- ☆39Dec 14, 2025Updated 3 months ago
- ☆22Dec 25, 2025Updated 2 months ago
- This repo contains the benchmarks for Enzyme on GPU's☆11Feb 22, 2026Updated last month
- Benchmarking LLMs on Typst☆19May 26, 2025Updated 9 months ago
- It is an LLM-based AI agent, which can write correct and efficient gpu kernels automatically.☆78Mar 18, 2026Updated last week
- Building the Virtuous Cycle for AI-driven LLM Systems☆204Updated this week
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 7 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- A basic repository for a Clang-based tool, with CMake integration.☆10Sep 22, 2023Updated 2 years ago
- The repo of "BugLens"☆39Nov 12, 2025Updated 4 months ago
- ☆41Jun 30, 2025Updated 8 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- A simple SQL parser based on Apache Calcite.☆13Jan 17, 2026Updated 2 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆332Updated this week
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago
- ☆10Mar 28, 2024Updated last year
- Work related to vectorizing strategies for arbitrary FHE programs☆10Sep 5, 2025Updated 6 months ago
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Jun 24, 2025Updated 9 months ago
- ☆15Aug 16, 2024Updated last year
- https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生命的轨迹漫步,重新触摸那些有温度的瞬间。☆27Mar 2, 2026Updated 3 weeks ago
- a size profiler for cuda binary☆71Jan 15, 2026Updated 2 months ago
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- ☆18Sep 27, 2022Updated 3 years ago
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆17Nov 20, 2025Updated 4 months ago
- ☆12Apr 30, 2024Updated last year