gogongxt / nano-sglangView external linksLinks
☆120Updated this week
Alternatives and similar repositories for nano-sglang
Users that are interested in nano-sglang are comparing it to the libraries listed below
Sorting:
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆64Oct 18, 2025Updated 3 months ago
- Cute layout visualization☆30Jan 18, 2026Updated 3 weeks ago
- A simple calculation for LLM MFU.☆67Sep 10, 2025Updated 5 months ago
- A Triton-only attention backend for vLLM☆23Updated this week
- A light llama-like llm inference framework based on the triton kernel.☆171Jan 5, 2026Updated last month
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang☆43Nov 19, 2025Updated 2 months ago
- ☆27Jan 7, 2025Updated last year
- Getting Started with Triton: A Tutorial for Python Beginners☆35Oct 21, 2025Updated 3 months ago
- 算子库☆17Jul 9, 2025Updated 7 months ago
- 基于昇腾310芯片的大语言模型部署☆24Jun 14, 2024Updated last year
- ☆155Mar 4, 2025Updated 11 months ago
- Benchmark code for the "Online normalizer calculation for softmax" paper☆105Jul 27, 2018Updated 7 years ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Nov 18, 2024Updated last year