SGLang is fast serving framework for large language models and vision language models.
☆34Nov 24, 2025Updated 4 months ago
Alternatives and similar repositories for worker-sglang
Users that are interested in worker-sglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆422Updated this week
- ☆12Feb 24, 2025Updated last year
- ⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️☆23Oct 8, 2023Updated 2 years ago
- Allows AI Agents to interact with the Twilio SendGrid v3 API, managing contact lists, templates, single sends, and stats☆26Feb 25, 2025Updated last year
- Create embeddings with infinity as serverless endpoint☆43Nov 21, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A list of awesome neural symbolic papers.☆52Jul 25, 2022Updated 3 years ago
- Daily.co + Pipecat + Tavus AI Avatar Agent☆15Apr 19, 2025Updated 11 months ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated last year
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated last month
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- 如何在美国加州建立501c3非盈利组织的文档☆15Sep 12, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆19May 4, 2023Updated 2 years ago
- Benchmarking the serving capabilities of vLLM☆59Aug 20, 2024Updated last year
- Clarify your words with emojis☆12Aug 25, 2016Updated 9 years ago
- Golang SDK for Truss☆40Updated this week
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- The repository is designed to help you build intent classification for user queries, and also generate tags for AI chat responses.☆12Mar 29, 2024Updated 2 years ago
- Self-host LLMs with LMDeploy and BentoML☆22Dec 26, 2025Updated 3 months ago
- Allows testing of Server Sent Events☆22Jan 7, 2023Updated 3 years ago
- Retrieval-Augmented Generation with pgvector as vector database☆13Jan 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 28, 2024Updated last year
- Create Python APIs with AI Agents.☆11Aug 16, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 6 months ago
- A card reader powered by WebUSB.☆14Mar 31, 2019Updated 7 years ago
- Resources for the Web MIDI API: Books, articles, software, demos etc...☆11Sep 14, 2017Updated 8 years ago
- Declarative AI Pipelines☆22Oct 2, 2024Updated last year
- ☆17Jun 21, 2024Updated last year
- VS Code inspired text editor that mostly runs in a webworker☆11Updated this week
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Machinery data, made easy. Easily download and prepare common industrial datasets.☆23Feb 13, 2024Updated 2 years ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Jul 5, 2023Updated 2 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Template for lead qualification voice AI assistant, based on the one used on my personal website☆15Apr 13, 2025Updated last year
- Open Pixel Control protocol☆15May 6, 2018Updated 7 years ago
- HTM Learning Algorithm Implementation for learning and generating musical sequences☆10Apr 14, 2015Updated 11 years ago
- ☆13Jan 17, 2024Updated 2 years ago