SGLang is fast serving framework for large language models and vision language models.
☆34Nov 24, 2025Updated 6 months ago
Alternatives and similar repositories for worker-sglang
Users that are interested in worker-sglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆444Updated this week
- ⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️☆23Oct 8, 2023Updated 2 years ago
- Create embeddings with infinity as serverless endpoint☆45Nov 21, 2025Updated 6 months ago
- ☆14Dec 21, 2025Updated 5 months ago
- A list of awesome neural symbolic papers.☆53Jul 25, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Unofficial GitLab Android client. Support self hosted GitLab and Push notifications☆10May 18, 2016Updated 10 years ago
- Daily.co + Pipecat + Tavus AI Avatar Agent☆16Apr 19, 2025Updated last year
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated 2 years ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆30Nov 21, 2025Updated 6 months ago
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated 3 months ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- 如何在美国加州建立501c3非盈利组织的文档☆15Sep 12, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Benchmarking the serving capabilities of vLLM☆58Aug 20, 2024Updated last year
- Clarify your words with emojis☆12Aug 25, 2016Updated 9 years ago
- Golang SDK for Truss☆40Apr 8, 2026Updated last month
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 8 months ago
- The repository is designed to help you build intent classification for user queries, and also generate tags for AI chat responses.☆12Mar 29, 2024Updated 2 years ago
- Self-host LLMs with LMDeploy and BentoML☆22Dec 26, 2025Updated 5 months ago
- Retrieval-Augmented Generation with pgvector as vector database☆13Jan 23, 2024Updated 2 years ago
- ☆20Oct 5, 2025Updated 7 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Moondream MCP Server in Python☆47Jul 2, 2025Updated 10 months ago
- Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionalit…☆53Oct 14, 2025Updated 7 months ago
- THINK LESS, SCREAM MORE!☆11Feb 17, 2016Updated 10 years ago
- ☆18Jun 21, 2024Updated last year
- 🔀 schedule functions on the main thread☆37Mar 10, 2022Updated 4 years ago
- Machinery data, made easy. Easily download and prepare common industrial datasets.☆23Feb 13, 2024Updated 2 years ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Jul 5, 2023Updated 2 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jan 17, 2024Updated 2 years ago
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- Universal connector to LLMs for Node.js & Bun☆30May 19, 2026Updated last week
- ☆24May 18, 2026Updated last week
- Learn how to create and deploy an ESP high availability system using Kafka as the message broker.☆11Feb 20, 2020Updated 6 years ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago
- Web VJing for everyone.☆11May 26, 2016Updated 10 years ago