antgroup/sglang

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/antgroup/sglang)

antgroup / sglang

SGLang is a fast serving framework for large language models and vision language models.

☆30

Alternatives and similar repositories for sglang

Users that are interested in sglang are comparing it to the libraries listed below

Sorting:

swjtu-maker / codes2things_0
View on GitHub
A course for Mao Yisheng College of SWJTU
☆11Mar 28, 2020Updated 5 years ago
MoFHeka / execution-ucx
View on GitHub
A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.
☆29Feb 22, 2026Updated last week
Elluran / concentration_notebooks
View on GitHub
☆11Dec 11, 2024Updated last year
alumik / dblp-api
View on GitHub
A helper package to get information of scholarly articles from DBLP using its public API
☆15May 13, 2025Updated 9 months ago
AniZpZ / smoothquant
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆11Dec 13, 2023Updated 2 years ago
reiase / probing
View on GitHub
☆14Nov 5, 2025Updated 3 months ago
bethelmelesse / UnifiedCrawl
View on GitHub
☆16Nov 26, 2024Updated last year
willow385 / djf-3d-2
View on GitHub
yet another C++ 3d engine
☆12Jan 24, 2020Updated 6 years ago
HArmonizedSS / HASS
View on GitHub
Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)
☆54Mar 14, 2025Updated 11 months ago
2pac-ZPaC / offershow
View on GitHub
校招薪水的一个数据爆料和展示平台
☆12Nov 26, 2016Updated 9 years ago
xzhseh / stlc-in-a-week
View on GitHub
Write yourself a simply-typed lambda calculus using Rust in a week!
☆13May 13, 2024Updated last year
NVIDIA / hoti-2025-gpu-comms-tutorial
View on GitHub
Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025
☆31Oct 22, 2025Updated 4 months ago
UCHI-DB / sboost
View on GitHub
SBoost is a SIMD-based C++ library enabling fast filtering and decoding of lightweight encoded data
☆11Jul 6, 2021Updated 4 years ago
sgl-project / sgl-flash-attn
View on GitHub
Fast and memory-efficient exact attention
☆18Updated this week
vakovalskii / cursor_agent_flow
View on GitHub
cursor logs with gpt-4o using litellm proxy
☆14Sep 9, 2025Updated 5 months ago
yanzihan1 / PSML
View on GitHub
TKDE-Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors
☆14Aug 14, 2022Updated 3 years ago
hyhuang00 / moe_inference
View on GitHub
Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
☆19Oct 30, 2024Updated last year
nyunAI / PruneGPT
View on GitHub
☆51May 31, 2024Updated last year
IST-DASLab / peft-rosa
View on GitHub
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
☆15Aug 16, 2024Updated last year
rustcc / GEFS
View on GitHub
Game Engine From Scratch -- Rust China Conference 2020 topic by LemonHX and his team.
☆14Dec 16, 2020Updated 5 years ago
AUTOMATIC1111 / onnx2c
View on GitHub
Open Neural Network Exchange to C compiler.
☆15Apr 1, 2024Updated last year
alessio-vivaldelli / java-creator-nvim
View on GitHub
Neovim plugin for generating Java files (classes, interfaces, enums, records) with package-aware autocompletion.
☆24Feb 7, 2026Updated 3 weeks ago
AlibabaPAI / FlashModels
View on GitHub
Fast and easy distributed model training examples.
☆12Nov 26, 2024Updated last year
pzs19 / TokenSelect
View on GitHub
☆18Mar 11, 2025Updated 11 months ago
Raphtory / docbrown
View on GitHub
Temporal graph storage in rust
☆13Apr 26, 2023Updated 2 years ago
73-SEVENTYTHREE / B-S-
View on GitHub
ZJU B/S体系软件设计课程大作业
☆13Jul 22, 2021Updated 4 years ago
eniac / TELEPORT
View on GitHub
Optimizing data-intensive systems in disaggregated data centers
☆13Jun 13, 2022Updated 3 years ago
Snowflake-Labs / vllm
View on GitHub
☆16Nov 24, 2025Updated 3 months ago
ModelEngine-Group / flexai
View on GitHub
☆35Updated this week
Sanich137 / ASR_FastAPI_WS_RU
View on GitHub
ASR on WS, POST/GET FAST_API Can use many RU asr models.
☆18Jan 27, 2026Updated last month
DeepAuto-AI / sglang
View on GitHub
This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.
☆18Dec 23, 2025Updated 2 months ago
NLP-Core-Team / mmlu_ru
View on GitHub
MMLU eval for RU/EN
☆15Jul 31, 2023Updated 2 years ago
oscomp / proj23-lightweight-hypervisor
View on GitHub
在RISC-V处理器上实现一个轻量级的Hypervisor。
☆12Dec 25, 2020Updated 5 years ago
Qi-Zhan / BrowsingYearReview
View on GitHub
快来生成你的浏览记录年度总结!
☆18Dec 12, 2024Updated last year
yanzihan1 / Use-Dynamic-network-embedding-for-Social-Network-Aligment-
View on GitHub
in this part, I will provide many tools for social networking, Link prediction and so on
☆14Oct 16, 2020Updated 5 years ago
igorshmukler / kokoro-ruslan
View on GitHub
Kokoro Language Model Training Script for Russian (Ruslan Corpus)
☆37Updated this week
fannheyward / coc-basedpyright
View on GitHub
Basedpyright extension for coc.nvim
☆14Feb 2, 2026Updated last month
yanzihan1 / IONE-Aligning-Users-across-Social-Networks-Using-Network-Embedding
View on GitHub
Aligning Users across Social Networks Using Network Embedding(IJCAI)，paper author uses Java,For wider application, we have updated the py…
☆15May 25, 2022Updated 3 years ago
CircuitCoder / ChannelOS
View on GitHub
What if everything is a io_uring?
☆17Nov 10, 2022Updated 3 years ago