SGLang is fast serving framework for large language models and vision language models.
☆35Nov 24, 2025Updated 6 months ago
Alternatives and similar repositories for worker-sglang
Users that are interested in worker-sglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️☆23Oct 8, 2023Updated 2 years ago
- Create embeddings with infinity as serverless endpoint☆46Nov 21, 2025Updated 6 months ago
- ☆14Dec 21, 2025Updated 5 months ago
- A list of awesome neural symbolic papers.☆53Jul 25, 2022Updated 3 years ago
- ☆98Mar 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- A server code for serving BERT-based models for text classification. It is designed by SerpApi for heavy-load prototyping and production …☆15Apr 17, 2024Updated 2 years ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆31Nov 21, 2025Updated 6 months ago
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated 3 months ago
- Normalize Text in Russian☆29Jun 7, 2026Updated last week
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- 如何在美国加州建立501c3非盈利组织的文档☆15Sep 12, 2021Updated 4 years ago
- ☆36Mar 5, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Улучшенный морфологический анализатор для русского языка с DAWG-оптимизацией☆34Nov 8, 2025Updated 7 months ago
- Benchmarking the serving capabilities of vLLM☆58Aug 20, 2024Updated last year
- The repository is designed to help you build intent classification for user queries, and also generate tags for AI chat responses.☆12Mar 29, 2024Updated 2 years ago
- Retrieval-Augmented Generation with pgvector as vector database☆13Jan 23, 2024Updated 2 years ago
- ☆14Jul 28, 2024Updated last year
- Create Python APIs with AI Agents.☆11Aug 16, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- A card reader powered by WebUSB.☆14Mar 31, 2019Updated 7 years ago
- Open Source multi-modal LLM environment. Host your own web and mobile chat interface, powered by real-time bots and voice AI functionalit…☆54Oct 14, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Resources for the Web MIDI API: Books, articles, software, demos etc...☆11Sep 14, 2017Updated 8 years ago
- ☆18Jun 21, 2024Updated last year
- ☆230May 26, 2026Updated 3 weeks ago
- 🔀 schedule functions on the main thread☆38Jun 7, 2026Updated last week
- Civitai Generator Node.js Client☆26Jul 1, 2024Updated last year
- always amend and --force push☆12Nov 28, 2017Updated 8 years ago
- Dashboard to monitor the performance of your Freqtrade instances☆13Apr 13, 2023Updated 3 years ago
- Frictionless Machine Learning on Kubernetes☆15Mar 7, 2023Updated 3 years ago
- Template for lead qualification voice AI assistant, based on the one used on my personal website☆15Apr 13, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- Tensara's GPU programming problems☆20Apr 23, 2026Updated last month
- Make tool-calling schemas for existing tools☆14Mar 8, 2025Updated last year
- Web UI for Bark by Suno.ai built with next.js☆12Jun 15, 2023Updated 3 years ago
- Learn how to create and deploy an ESP high availability system using Kafka as the message broker.☆11Feb 20, 2020Updated 6 years ago
- Data and graphs for repos and events from We Build SG☆16Aug 29, 2018Updated 7 years ago