LeetGPU Solutions
☆116Oct 9, 2025Updated 7 months ago
Alternatives and similar repositories for LeetGPU
Users that are interested in LeetGPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 26, 2025Updated last year
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated last year
- ☆40Dec 14, 2025Updated 4 months ago
- SGLang kernel library for NPU☆128Updated this week
- Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend☆119Apr 30, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- ☆11Jun 11, 2023Updated 2 years ago
- ☆10Jun 10, 2023Updated 2 years ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆151May 10, 2025Updated 11 months ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated 2 weeks ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆31Dec 21, 2024Updated last year
- The GaussianSplatting Implementation based on LuisaCompute☆18Apr 11, 2026Updated 3 weeks ago
- TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆204May 1, 2026Updated last week
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆93Apr 14, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cataloging released Triton kernels.☆302Sep 9, 2025Updated 8 months ago
- visual studio code extension for TDengine☆10Mar 21, 2023Updated 3 years ago
- Shared Middle-Layer for Triton Compilation☆331Dec 5, 2025Updated 5 months ago
- FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA.☆277Updated this week
- GPGPU-Sim 中文注释版代码,包含 GPGPU-Sim 模拟器的最新版代码,经过中文注释,以帮助中文用户更好地理解和使用该模拟器。☆27Dec 18, 2024Updated last year
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆41Apr 21, 2026Updated 2 weeks ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆13Jun 5, 2024Updated last year
- ☆19Jun 13, 2025Updated 10 months ago
- AP1400-2☆10Aug 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch library for cost-effective, fast and easy serving of MoE models.☆303Updated this week
- ☆97Mar 21, 2026Updated last month
- CVPR'24: MS-MANO: Enabling Hand Pose Tracking with Biomechanical Constraints☆16Jul 4, 2024Updated last year
- The Next-gen Language & Compiler Powering Efficient Hardware Design☆37Jan 16, 2025Updated last year
- A Rust library for creating solvers in the OP Stack's dispute protocol☆19Jan 15, 2024Updated 2 years ago
- A Triton-only attention backend for vLLM☆25Mar 17, 2026Updated last month
- FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.☆179Updated this week
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Nov 18, 2024Updated last year
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆181Feb 11, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Development repository for the Triton-Linalg conversion☆218Feb 7, 2025Updated last year
- Puzzles for learning Triton, play it with minimal environment configuration!☆691Mar 17, 2026Updated last month
- Implementation for Interactive Monte Carlo Denoising using Affinity of Neural Features☆10Dec 9, 2021Updated 4 years ago
- ☆45Nov 1, 2025Updated 6 months ago
- 🎲 Simple, compiler agnostic, C++23 reflection library (for aggregates and enums)☆19Aug 26, 2025Updated 8 months ago
- A Triton JIT runtime and ffi provider in C++☆33Apr 28, 2026Updated last week
- ☆13Apr 13, 2026Updated 3 weeks ago