Automated High-Performance GPU Kernel Generation
☆114Jun 1, 2026Updated last week
Alternatives and similar repositories for K-Search
Users that are interested in K-Search are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repo of CudaForge☆83Dec 2, 2025Updated 6 months ago
- Ship correct and fast LLM kernels to PyTorch☆150Jan 14, 2026Updated 4 months ago
- FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.☆198Jun 7, 2026Updated last week
- Advancing the frontier of efficient AI☆66Jun 3, 2026Updated last week
- Framework to reduce autotune overhead to zero for well known deployments.☆101Sep 19, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆111May 3, 2026Updated last month
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆1,060Mar 24, 2026Updated 2 months ago
- Samples of good AI generated CUDA kernels☆105May 30, 2025Updated last year
- ☆97Nov 22, 2025Updated 6 months ago
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆216Apr 30, 2026Updated last month
- ☆13Apr 9, 2025Updated last year
- For building the world's largest dataset of GPU kernels.☆10Jun 5, 2026Updated last week
- ☆32Jul 2, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Utility that parses stack sizes section from elf objects and displays the preallocated stack size of each function.☆14Jan 15, 2020Updated 6 years ago
- ☆40Dec 14, 2025Updated 6 months ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- ☆28Apr 7, 2026Updated 2 months ago
- This repo contains the benchmarks for Enzyme on GPU's☆11May 28, 2026Updated 2 weeks ago
- code and data for Improving Temporal Link Prediction via Temporal Walk Matrix Projection, NeurIPS 2024☆15Oct 5, 2024Updated last year
- Benchmarking LLMs on Typst☆21May 26, 2025Updated last year
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A basic repository for a Clang-based tool, with CMake integration.☆10Sep 22, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Building the Virtuous Cycle for AI-driven LLM Systems☆243May 1, 2026Updated last month
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated last year
- NUMA-aware parallel packed CSR data structure for large-scale dynamic graph data☆14May 11, 2026Updated last month
- The repo of "BugLens"☆41Nov 12, 2025Updated 7 months ago
- ☆42May 19, 2026Updated 3 weeks ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- A pure-Python implementation of the Nvidia CuTe layout algebra intended to be approachable and easy to learn.☆184May 15, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generating Efficient AI-Centric Kernels☆104Updated this week
- This is the code for the paper published in IEEE Cloud Computing 2022☆12Jul 22, 2022Updated 3 years ago
- A lightweight tool for detecting bugs on Graph Database Management Systems☆15Jan 9, 2024Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- ☆168Dec 27, 2024Updated last year
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆358Updated this week
- Datalog Engines OPtimization Tester.☆13Jan 18, 2024Updated 2 years ago