SGLang is a fast serving framework for large language models and vision language models.
☆21May 22, 2025Updated last year
Alternatives and similar repositories for sglang
Users that are interested in sglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆67Oct 25, 2024Updated last year
- ☆11May 17, 2024Updated 2 years ago
- ☆13Oct 2, 2024Updated last year
- obs-studio plugin to simulate a directshow webcam☆10Aug 19, 2022Updated 3 years ago
- ☆12Nov 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- llmware RAG Demo App.☆17Dec 10, 2023Updated 2 years ago
- Collection of examples and how-tos using experimental API for Midjourney, Pika and InsightFaceSwap Discord bots.☆18Aug 16, 2025Updated 9 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming☆11Jun 25, 2020Updated 5 years ago
- Our Graduation project for FCIS mansoura university - CS depart☆14Nov 27, 2022Updated 3 years ago
- Little directed graph with backlink support.☆11Nov 19, 2015Updated 10 years ago
- Learn How to Use the Google Slides API☆23Nov 18, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Basicss OOCSS framework☆10Nov 26, 2018Updated 7 years ago
- Faster Whisper with additional features☆49Mar 10, 2025Updated last year
- League of Legends v4.20 RL Environment (LoLRLE)☆22Feb 23, 2025Updated last year
- A simple model context protocol (MCP) server that allows Claude Desktop or other MCP aware clients to run Bash commands on your local mac…☆32Apr 14, 2025Updated last year
- Simple stream multiplexing for objectMode.☆15Mar 28, 2025Updated last year
- Amazon Lambda Decider for Simple WorkFlow (SWF)☆11Sep 1, 2015Updated 10 years ago
- ☆19Sep 21, 2022Updated 3 years ago
- ☆15Jan 1, 2024Updated 2 years ago
- AWS Lambda Scheduler -- Use cron expressions to schedule Aws Lambda Functions☆12Aug 14, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Contains scripts I created for obs-studio (https://github.com/obsproject/obs-studio)☆29Sep 23, 2018Updated 7 years ago
- THIS REPO HAS BEEN MOVED TO https://github.com/sockethub/sockethub - a simple tool to facilitate handling and referencing activity stream…☆11Dec 30, 2019Updated 6 years ago
- Realtime demo, Streaming and Finetuning code for CSM☆455Sep 17, 2025Updated 8 months ago
- API Explorer to browse JSON Schema based API's☆39Dec 1, 2022Updated 3 years ago
- RunJOP (Run Just Once Please) is a distributed execution framework to run a command (i.e. a job) only once in a group of servers.☆20Feb 28, 2014Updated 12 years ago
- Llama 3 ORPO Fine Tuning on A100 in Colab Pro.☆12Apr 21, 2024Updated 2 years ago
- Pdf Query chat-bot using Gemini AI and Llma Index☆10Dec 24, 2023Updated 2 years ago
- Stripdown of the mean.io stack for the ngFantasyFootball application☆111Apr 30, 2014Updated 12 years ago
- Create JavaScript Error objects with code strings, context details, and uncluttered stacktraces☆11Feb 18, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Collection of Node.js streams.☆20Sep 7, 2017Updated 8 years ago
- OBS broadcast tools☆23Aug 27, 2020Updated 5 years ago
- Learn & build: Always available expertise powered by AI☆13Jul 10, 2023Updated 2 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Fork of https://github.com/elastic/supply-chain-monitor with local AI backend (vLLM/llama.cpp)☆61Apr 2, 2026Updated last month
- A terraform provider for Kaleido!☆10May 23, 2026Updated last week
- Happy wrapper for PDF.JS in Ember!☆10May 7, 2024Updated 2 years ago