SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
☆287May 21, 2026Updated this week
Alternatives and similar repositories for sglang-omni
Users that are interested in sglang-omni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MurisPro-专业的小鼠管理软件,造福广大需要动物实验的朋友☆25Dec 28, 2025Updated 4 months ago
- ☆27May 12, 2026Updated last week
- ☆113Updated this week
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated last year
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Regex engine which is implemented in a traditional way and able to generate graphics of finite automation.☆10May 3, 2018Updated 8 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆18Nov 4, 2025Updated 6 months ago
- ◉ Universal Intelligence: AI made simple.☆56Apr 16, 2026Updated last month
- ☆13Mar 27, 2020Updated 6 years ago
- 使用VC检测车道线(曲线)☆10Apr 23, 2018Updated 8 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.☆62Nov 24, 2025Updated 5 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Nano vLLM☆13Jun 26, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆109May 20, 2025Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- GUI tools for WORLD vocoder☆21Dec 19, 2024Updated last year
- 一个谷歌高清图片爬虫☆13Jan 7, 2020Updated 6 years ago
- ☆52Mar 9, 2026Updated 2 months ago
- Fast parallel RNN-Transducer.☆10Nov 1, 2019Updated 6 years ago
- ☆26Apr 21, 2021Updated 5 years ago
- a student trainning project for HLS and transformer☆11Oct 19, 2022Updated 3 years ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆101Mar 14, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- interactive ascii call graph☆14Jan 11, 2026Updated 4 months ago
- A code highlight plugin,use shiki for better highlight,built for hexo!☆10May 15, 2026Updated last week
- Backtracking regular expression engine written in Python☆13Nov 4, 2022Updated 3 years ago
- 关于无锁队列的知识☆11Feb 13, 2017Updated 9 years ago
- Sentiment Analysis Adapter trained on the Yahoo Movie Review dataset by Bandai Namco Research Inc.☆10Oct 21, 2020Updated 5 years ago
- ☆90Jun 16, 2025Updated 11 months ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆53Updated this week
- ModelQ is a lightweight, battle-tested Python library for scheduling and queuing machine learning inference tasks. It's designed as a fas…☆18May 11, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆30Apr 8, 2026Updated last month
- PowerSwitch: a adaptive mode switch engine for distributed parrallel graph computation☆16Dec 23, 2013Updated 12 years ago
- THUIR website☆10Feb 23, 2026Updated 3 months ago
- teb_local_planner source code without ros.☆15Jun 18, 2024Updated last year
- MeloTTS demo on Axera☆12Nov 18, 2025Updated 6 months ago
- Demo of using WASM to sandbox Plotly execution☆20Mar 30, 2025Updated last year
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago