vLLM patcher for Qwen3.6 on consumer NVIDIA — Qwen3.6-35B-A3B-FP8 (192 tok/s, +68% over stock) + Qwen3.6-27B-int4-AutoRound + 256K context. 126 patches: TurboQuant k8v4 KV, MTP/DFlash spec-decode, FULL cudagraph, hybrid GDN streaming, structured boot summary, one-command installer, 1958 tests. v7.72.2.
☆106May 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for genesis-vllm-patches
Users that are interested in genesis-vllm-patches are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆104Feb 3, 2026Updated 4 months ago
- 饥荒DST-server搭建☆14Nov 6, 2023Updated 2 years ago
- Tool to format gherkin-ast model to gherkin string☆12May 9, 2026Updated last month
- ☆11Jan 5, 2018Updated 8 years ago
- Async flow control for directed-acyclic-graph iteration.☆16Nov 12, 2012Updated 13 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Jun 1, 2026Updated last week
- ☆13Apr 26, 2026Updated last month
- An OpenID Connect ID claims set implementation☆16May 26, 2016Updated 10 years ago
- Supplemental utilities to help with processing data replicated into Hadoop☆14Apr 16, 2018Updated 8 years ago
- 饥荒开服工具(感谢铅笔的前期辛勤付出~)☆10Mar 19, 2019Updated 7 years ago
- Sample app demonstrating Aurelia Material Design Lite binding☆15Jul 20, 2015Updated 10 years ago
- Aurelia bind table integration for RethinkDB via Socket.io☆16Apr 1, 2017Updated 9 years ago
- Run AI coding agents (OpenCode, Claude Code, Gemini CLI) on Android☆143Jan 7, 2026Updated 5 months ago
- A highly optimised, feature rich zsh config with almost under ~20ms load times.☆146Mar 21, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- FSPEC: The Spec-Driven, Multi-Agent Coding Factory. It is infrastructure for the "Dark Factory"—the emerging model of fully autonomous so…☆72Updated this week
- ☆69Feb 27, 2026Updated 3 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- Use WeChat to route messages to Claude Code or Codex.☆85Mar 25, 2026Updated 2 months ago
- Buffered, ack based message queue using socket.io☆20Feb 6, 2019Updated 7 years ago
- Qwen3.6-35B-A3B-heretic NVFP4 + DFlash speculative decoding on DGX Spark (GB10/sm_121a). Source-built vLLM image + 7 patches + comprehens…☆83May 1, 2026Updated last month
- This is a plugin for Premiere Pro, which provídes an automated way to update timecodes / start times of media (clips) in your projects.☆11May 6, 2026Updated last month
- Swing image viewer component☆15Jul 30, 2012Updated 13 years ago
- LingTai AI☆202Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Guided meditation assistant, using scheduled messages with LLaMA☆10Nov 28, 2024Updated last year
- Advanced MT5 indicators and expert advisors experiments☆22Jan 20, 2017Updated 9 years ago
- Spout plugin for Unreal Engine 5 using DirectX12☆34May 23, 2026Updated 2 weeks ago
- Evepraisal helper functions intended for use in Google sheets☆13Oct 12, 2018Updated 7 years ago
- ☆14Mar 30, 2026Updated 2 months ago
- aurelia plugin to observe DOM-element resize events☆16Feb 5, 2019Updated 7 years ago
- ☆15Jul 19, 2024Updated last year
- This repository provides sample apps demonstrating Krisp SDK functionality.☆21Feb 20, 2026Updated 3 months ago
- EveSerenityAccountManager☆17Nov 11, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- All you need to get started with Substance editor development.☆28Mar 6, 2017Updated 9 years ago
- Infinite Scrolling for Aurelia☆14May 19, 2018Updated 8 years ago
- Serverless demo using Code Build and CodePipeline for CI/CD☆22Apr 19, 2023Updated 3 years ago
- AOP Samples - different ways of writing Aspects☆14Jul 30, 2022Updated 3 years ago
- FLUX-REALISM is an experimental, advanced image generation application designed to provide highly realistic image synthesis workflows. Po…☆18May 22, 2026Updated 2 weeks ago
- A client library to query WebFinger records☆55Jun 1, 2026Updated last week
- NVIDIA Nemo Parakeet TDT 0.6B V2 Audio to Text Python Script☆20May 8, 2025Updated last year