Official Repo of CudaForge
☆70Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for CudaForge
Users that are interested in CudaForge are comparing it to the libraries listed below
Sorting:
- Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model☆85Mar 2, 2026Updated 2 weeks ago
- ☆91Nov 22, 2025Updated 3 months ago
- Sample Codes using NVSHMEM on Multi-GPU☆30Jan 22, 2023Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/☆11Oct 26, 2025Updated 4 months ago
- Ship correct and fast LLM kernels to PyTorch☆145Jan 14, 2026Updated 2 months ago
- It is an LLM-based AI agent, which can write correct and efficient gpu kernels automatically.☆78Updated this week
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-perfo…☆94Feb 2, 2026Updated last month
- Autonomous GPU Kernel Generation & Optimization via Deep Agents☆309Mar 10, 2026Updated last week
- ☆125Updated this week
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆29Jan 13, 2026Updated 2 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆294Nov 3, 2025Updated 4 months ago
- Dynamic resources changes for multi-dimensional parallelism training☆30Aug 22, 2025Updated 6 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated 2 months ago
- Evolutionary algorithm discovery using Claude Code☆34Feb 11, 2026Updated last month
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆21Updated this week
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- Code for DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.☆11Apr 13, 2022Updated 3 years ago
- ☆10Nov 18, 2024Updated last year
- Website for CSE 234, Winter 2025☆13Mar 24, 2025Updated 11 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆23Jan 4, 2026Updated 2 months ago
- Samples of good AI generated CUDA kernels☆101May 30, 2025Updated 9 months ago
- Add specified programs (apps) on PATH.☆20Aug 26, 2025Updated 6 months ago
- General benchmarking apparatus for running multi-agent systems against benchmarks☆43Mar 12, 2026Updated last week
- Cute layout visualization☆33Jan 18, 2026Updated 2 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- ☆33Oct 13, 2025Updated 5 months ago
- string_view implementation for libc++. This should be a short-lived repo☆12Jun 11, 2014Updated 11 years ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- A custom Linux system for running Dividat Play.☆19Mar 12, 2026Updated last week
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆29Jan 22, 2026Updated last month
- Expert Specialization MoE Solution based on CUTLASS☆27Jan 19, 2026Updated 2 months ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- Docker images to build C++ projects. Includes cmake, conan, Qt and different compilers.☆11Jan 21, 2023Updated 3 years ago
- [CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction☆54Updated this week
- ☆18Mar 4, 2025Updated last year
- This repository consists of a library designed to make parsing command line arguments for c++ easy and efficient and a few simple program…☆11Mar 9, 2020Updated 6 years ago