sgl-project / sgl-project.github.ioLinks
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang
☆100Updated this week
Alternatives and similar repositories for sgl-project.github.io
Users that are interested in sgl-project.github.io are comparing it to the libraries listed below
Sorting:
- Benchmark and optimize LLM inference across frameworks with ease☆161Updated 4 months ago
- ☆237Updated 2 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆287Updated this week
- Self-host LLMs with vLLM and BentoML☆168Updated 2 weeks ago
- ☆270Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆110Updated 8 months ago
- Utils for Unsloth https://github.com/unslothai/unsloth☆191Updated this week
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆259Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.☆50Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- LLM inference in C/C++☆104Updated last week
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆642Updated last week
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆366Updated this week
- Codebase for FinePDFs☆174Updated last month
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated 2 months ago
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, datasets, and full end-to-end refere…☆392Updated this week
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 9 months ago
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆201Updated last month
- ☆220Updated 3 months ago
- Inference server benchmarking tool☆142Updated 4 months ago
- Tutorial for building LLM router☆244Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆224Updated 5 months ago
- ☆76Updated 7 months ago
- Route LLM requests to the best model for the task at hand.☆177Updated 3 weeks ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 5 months ago
- Pivotal Token Search☆144Updated last month
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 9 months ago