SGLang is fast serving framework for large language models and vision language models.
☆33Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for worker-sglang
Users that are interested in worker-sglang are comparing it to the libraries listed below
Sorting:
- ⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️☆23Oct 8, 2023Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- Frictionless Machine Learning on Kubernetes☆15Mar 7, 2023Updated 2 years ago
- InSales e-commerce platform API bindings☆14Jul 13, 2024Updated last year
- The repository is designed to help you build intent classification for user queries, and also generate tags for AI chat responses.☆12Mar 29, 2024Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 5 months ago
- 🔀 schedule functions on the main thread☆37Mar 10, 2022Updated 3 years ago
- Source code obfuscator for Golang☆34Oct 5, 2024Updated last year
- 🔐 go-krypto: A Go library collecting cryptographic algorithms designed in the Republic of Korea (SEED, ARIA, HIGHT, LEA, HAS160, LSH, KC…☆35Sep 10, 2024Updated last year
- ☆29Dec 20, 2025Updated 2 months ago
- Make Git monorepos like a boss.☆13Dec 18, 2025Updated 2 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- better launcher of ollama☆17Feb 23, 2026Updated last week
- A local, voice-controlled AI assistant with the personality of HAL 9000 from 2001: A Space Odyssey.☆22Aug 16, 2025Updated 6 months ago
- 235,886 Words for Go☆12Nov 16, 2018Updated 7 years ago
- VS Code inspired text editor that mostly runs in a webworker☆11Updated this week
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 5 months ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Using a GAN to synthetically generate medical images for DL purposes☆11Jun 28, 2023Updated 2 years ago
- Chrome/Firefox extension to help you write gud anywhere there's a text field in your browser via an LLM☆10Feb 2, 2025Updated last year
- Learn how to create and deploy an ESP high availability system using Kafka as the message broker.☆10Feb 20, 2020Updated 6 years ago
- Ampersand CLI☆17Feb 18, 2026Updated last week
- ☆10Feb 19, 2022Updated 4 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated 11 months ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- Problem statements and code from Google Foobar☆10Nov 10, 2020Updated 5 years ago
- ☆13Oct 1, 2024Updated last year
- Concurrent hash tries for C++ 14 with no memory management whatsoever.☆10Aug 30, 2016Updated 9 years ago
- Piper based VoiceDock TTS implementation☆11Aug 12, 2023Updated 2 years ago
- A drag-and-drop-enabled, responsive, envelope graph that allows to shape a wave with attack, decay, sustain and release☆11Jan 5, 2023Updated 3 years ago
- [RFC9380] Hash to curves - Go reference implementation☆21Nov 20, 2025Updated 3 months ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- WebRTC Group Call is a simple video chat application for multi-users based on React, Node Express and WebRTC.☆11Feb 9, 2025Updated last year
- a distributed key-value store written in python☆13Oct 12, 2020Updated 5 years ago
- Memory experiments with LLMs☆11Mar 31, 2023Updated 2 years ago
- A Go implementation of Rust's evmap which optimizes for high-read, low-write workloads and uses eventual consistency to ensure that reade…☆10Aug 21, 2022Updated 3 years ago
- ☆11Aug 11, 2016Updated 9 years ago