sgl-project / sgl-project.github.io

This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.

☆32

Alternatives and similar repositories for sgl-project.github.io:

Users that are interested in sgl-project.github.io are comparing it to the libraries listed below

bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆97Updated last week
weaviate / mcp-server-weaviate
MCP (Model Context Protocol) server for Weaviate
☆58Updated 3 weeks ago
lightblue-tech / lb-reranker
☆22Updated 2 months ago
langfuse / oss-llmops-stack
Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…
☆90Updated last month
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆59Updated 7 months ago
moxin-org / Moxin-LLM
Moxin is a family of fully open-source and reproducible LLMs
☆85Updated 2 weeks ago
LLMSELECTOR / LLMSELECTOR
☆61Updated last month
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 2 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆19Updated 3 months ago
nospecs / ovadare
AI conflict resolution framework designed to work alongside existing AI orchestration tools
☆23Updated 3 months ago
substratusai / vllm-docker
☆56Updated last week
unslothai / llama.cpp
LLM inference in C/C++
☆67Updated last week
bentoml / rag-tutorials
a series of tutorials implementing rag service with BentoML and LlamaIndex
☆36Updated 3 months ago
catena-labs / moa-llm
A Python library to orchestrate LLMs in a neural network-inspired structure
☆46Updated 5 months ago
quantalogic / qllm
QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…
☆33Updated last month
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆52Updated 4 months ago
dottxt-ai / prompts
A prompting library
☆156Updated 6 months ago
cognitivecomputations / kraken
☆66Updated 10 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆87Updated 3 months ago
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆65Updated 8 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆67Updated 5 months ago
unslothai / unsloth-studio
Unsloth Studio
☆74Updated 3 weeks ago
cognitivecomputations / SystemChat
☆30Updated 8 months ago
simonw / llm-command-r
Access the Cohere Command R family of models
☆35Updated last week
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆145Updated 2 months ago
ibm-granite / granite-3.0-language-models
☆255Updated 3 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆151Updated this week
sony / talkhier
Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"
☆47Updated last month
run-llama / mixtral_ollama
☆46Updated last year
wandb / programmer
☆54Updated 3 months ago