SGLang is a fast serving framework for large language models and vision language models.
☆30Mar 1, 2026Updated this week
Alternatives and similar repositories for sglang
Users that are interested in sglang are comparing it to the libraries listed below
Sorting:
- A course for Mao Yisheng College of SWJTU☆11Mar 28, 2020Updated 5 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Feb 22, 2026Updated last week
- ☆11Dec 11, 2024Updated last year
- A helper package to get information of scholarly articles from DBLP using its public API☆15May 13, 2025Updated 9 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- ☆14Nov 5, 2025Updated 3 months ago
- ☆16Nov 26, 2024Updated last year
- yet another C++ 3d engine☆12Jan 24, 2020Updated 6 years ago
- Official Implementation of "Learning Harmonized Representations for Speculative Sampling" (HASS)☆54Mar 14, 2025Updated 11 months ago
- 校招薪水的一个数据爆料和展示平台☆12Nov 26, 2016Updated 9 years ago
- Write yourself a simply-typed lambda calculus using Rust in a week!☆13May 13, 2024Updated last year
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆31Oct 22, 2025Updated 4 months ago
- SBoost is a SIMD-based C++ library enabling fast filtering and decoding of lightweight encoded data☆11Jul 6, 2021Updated 4 years ago
- Fast and memory-efficient exact attention☆18Updated this week
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 5 months ago
- TKDE-Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors☆14Aug 14, 2022Updated 3 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- ☆51May 31, 2024Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- Game Engine From Scratch -- Rust China Conference 2020 topic by LemonHX and his team.☆14Dec 16, 2020Updated 5 years ago
- Open Neural Network Exchange to C compiler.☆15Apr 1, 2024Updated last year
- Neovim plugin for generating Java files (classes, interfaces, enums, records) with package-aware autocompletion.☆24Feb 7, 2026Updated 3 weeks ago
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- ☆18Mar 11, 2025Updated 11 months ago
- Temporal graph storage in rust☆13Apr 26, 2023Updated 2 years ago
- ZJU B/S体系软件设计课程大作业☆13Jul 22, 2021Updated 4 years ago
- Optimizing data-intensive systems in disaggregated data centers☆13Jun 13, 2022Updated 3 years ago
- ☆16Nov 24, 2025Updated 3 months ago
- ☆35Updated this week
- ASR on WS, POST/GET FAST_API Can use many RU asr models.☆18Jan 27, 2026Updated last month
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Dec 23, 2025Updated 2 months ago
- MMLU eval for RU/EN☆15Jul 31, 2023Updated 2 years ago
- 在RISC-V处理器上实现一个轻量级的Hypervisor。☆12Dec 25, 2020Updated 5 years ago
- 快来生成你的浏览记录年度总结!☆18Dec 12, 2024Updated last year
- in this part, I will provide many tools for social networking, Link prediction and so on☆14Oct 16, 2020Updated 5 years ago
- Kokoro Language Model Training Script for Russian (Ruslan Corpus)☆37Updated this week
- Basedpyright extension for coc.nvim☆14Feb 2, 2026Updated last month
- Aligning Users across Social Networks Using Network Embedding(IJCAI),paper author uses Java,For wider application, we have updated the py…☆15May 25, 2022Updated 3 years ago
- What if everything is a io_uring?☆17Nov 10, 2022Updated 3 years ago