Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.
☆836Mar 19, 2026Updated last week
Alternatives and similar repositories for autokernel
Users that are interested in autokernel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM4Kernel: A Survey of Large Language Models for GPU Kernel Development☆56Mar 19, 2026Updated last week
- Triton kernels for Flux☆22Jul 7, 2025Updated 8 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels☆153Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Dec 8, 2024Updated last year
- A flask-app that helps me write blogpost.☆12Mar 15, 2025Updated last year
- ☆43Mar 3, 2026Updated 3 weeks ago
- A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-perfo…☆96Feb 2, 2026Updated last month
- ☆36Jan 25, 2026Updated 2 months ago
- Making code edting up to 7.7x faster using multi-layer speculation☆24Feb 20, 2025Updated last year
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆25Aug 27, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A PyTorch-native inference engine with hybrid cache acceleration and massive parallelism for DiTs.☆1,102Mar 20, 2026Updated last week
- Weaving prompts and code into structured, resilient patterns that won't unravel under pressure.☆30Dec 6, 2025Updated 3 months ago
- NVIDIA Nemo Parakeet TDT 0.6B V2 Audio to Text Python Script☆21May 8, 2025Updated 10 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- Paging Debug tool for GDB using python☆13Jun 4, 2022Updated 3 years ago
- Shor's algorithm simulation using CUDA☆19Nov 10, 2019Updated 6 years ago
- real-time speech enhance skip-dpcrn-base using C++☆25Nov 12, 2022Updated 3 years ago
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆46Updated this week
- ☆43Jan 30, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- LLM evals to test open-source inference providers☆56Mar 12, 2026Updated 2 weeks ago
- PyTorch implementation of the Flash Spectral Transform Unit.☆22Sep 19, 2024Updated last year
- Code snippets and reproductions from JustAByte☆25Jan 25, 2026Updated 2 months ago
- 👷 Build compute kernels☆216Jan 27, 2026Updated 2 months ago
- Audio-FLAN☆160Sep 23, 2025Updated 6 months ago
- PTX-EMU is a simple emulator for CUDA program.☆38Apr 25, 2025Updated 11 months ago
- An OpenAI API compatible FastAPI server that sits on top of the Anemll repo. Tested with Open WebUI.☆20Jan 21, 2026Updated 2 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆92Feb 26, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- CLI that uses DSPy to interact with MCP servers.☆24Mar 10, 2025Updated last year
- ☆39Dec 18, 2025Updated 3 months ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)☆884Updated this week
- EgoVerse: Egocentric Data for Robot Learning from Around the World☆174Updated this week
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆88Feb 3, 2026Updated last month
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆171Mar 9, 2026Updated 2 weeks ago