sgl-project/sgl-cookbook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sgl-project/sgl-cookbook)

sgl-project / sgl-cookbook

Cookbook of SGLang - Recipe

☆93

Alternatives and similar repositories for sgl-cookbook

Users that are interested in sgl-cookbook are comparing it to the libraries listed below

Sorting:

HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated last month
Yifei-Zuo / Flash-LLA
View on GitHub
Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…
☆23Oct 1, 2025Updated 5 months ago
thib-s / flash-newton-schulz
View on GitHub
My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.
☆32Dec 5, 2025Updated 3 months ago
google-deepmind / agent_debugger
View on GitHub
Causal Analysis of Agent Behavior for AI Safety
☆20Jun 27, 2023Updated 2 years ago
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
microsoft / CollabLLM
View on GitHub
☆32Aug 7, 2025Updated 7 months ago
lemyx / tilelang-dsa
View on GitHub
DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang
☆44Nov 19, 2025Updated 3 months ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 6 months ago
Fridge003 / Cuda-Learn-By-Practice
View on GitHub
Codebase for Cuda Learning
☆31Jul 13, 2024Updated last year
MLSys-Learner-Resources / Awesome-MLSys-Blogger
View on GitHub
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆327Jan 5, 2025Updated last year
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆35Jul 29, 2025Updated 7 months ago
tile-ai / tilescale
View on GitHub
Tile-based language built for AI computation across all scales
☆138Feb 27, 2026Updated last week
ademeure / DeeperGEMM
View on GitHub
DeeperGEMM: crazy optimized version
☆74May 5, 2025Updated 10 months ago
rasbt / comparing-automatic-augmentation-blog
View on GitHub
Comparing four automatic image augmentation techniques in PyTorch: AutoAugment, RandAugment, AugMix, and TrivialAugment
☆31Feb 7, 2023Updated 3 years ago
intel / video-streamer
View on GitHub
The repository contains a reference end-to-end pipeline for a real-time video analytics application. Realtime data is provided to an infe…
☆12Nov 3, 2025Updated 4 months ago
google-research / precondition
View on GitHub
☆31Updated this week
hnam-1765 / WriteViT
View on GitHub
☆18Sep 23, 2025Updated 5 months ago
WaveSpeedAI / QuantumAttention
View on GitHub
[WIP] Better (FP8) attention for Hopper
☆32Feb 24, 2025Updated last year
LAION-AI / Conditional-Pretraining-of-Large-Language-Models
View on GitHub
☆37May 7, 2023Updated 2 years ago
marcow1601 / Plane-Stabilizer-and-Autopilot
View on GitHub
Automatic stabilizing and auto-piloting system for RC flying wing
☆14Mar 3, 2016Updated 10 years ago
reinout / reinout.vanrees.org
View on GitHub
Software that runs reinout.vanrees.org
☆20Updated this week
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Feb 20, 2026Updated 2 weeks ago
Tencent-Hunyuan / flex-block-attn
View on GitHub
flex-block-attn: an efficient block sparse attention computation library
☆124Dec 26, 2025Updated 2 months ago
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,380Feb 13, 2026Updated 3 weeks ago
AlibabaPAI / FLASHNN
View on GitHub
☆104Sep 9, 2024Updated last year
infinigence / FlashOverlap
View on GitHub
A lightweight design for computation-communication overlap.
☆223Jan 20, 2026Updated last month
alenic / timm-models-explorer
View on GitHub
Timm model explorer
☆42Apr 12, 2024Updated last year
AnesBenmerzoug / langsfer
View on GitHub
A library for language transfer methods and algorithms.
☆16Feb 6, 2026Updated last month
jacobmarks / huggingface-fiftyone-converters
View on GitHub
Convert datasets from Hugging Face to FiftyOne for Visualization
☆11Mar 15, 2024Updated last year
OpenVPN / openvpn-rfc
View on GitHub
Work in progress to create an RFC that documents the OpenVPN protocol
☆14Nov 24, 2025Updated 3 months ago
josuttis / cppmodules
View on GitHub
☆10May 23, 2023Updated 2 years ago
jargonLint / jargonLint
View on GitHub
jargonLint: A project to build and maintain comprehensive Vale rules.
☆16Feb 2, 2026Updated last month
Yang011013 / Awesome-Streaming-Video-Understanding
View on GitHub
Awesome latest models, datasets and benchmarks on streaming/online video understanding.
☆23Oct 19, 2025Updated 4 months ago
vdesai2014 / diffusion-policy-accelerated
View on GitHub
☆12Mar 7, 2024Updated 2 years ago
Zolewit / TNNdemo
View on GitHub
很好用的tnn classify demo
☆11Mar 24, 2021Updated 4 years ago
gaocegege / xuruowei-forever
View on GitHub
https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生命的轨迹漫步，重新触摸那些有温度的瞬间。
☆27Mar 2, 2026Updated last week
shyhoom / T22_034_CRDDC_2022_SourceCode
View on GitHub
T22_034_han_shi_hao_CRDDC_2022_SourceCode
☆11Dec 29, 2023Updated 2 years ago
Deep-Learning-Profiling-Tools / triton-samples
View on GitHub
☆14Mar 8, 2025Updated last year
suzukimain / auto_diffusers
View on GitHub
diffusers with search engine
☆12Jan 13, 2026Updated last month