A high-throughput and memory-efficient inference and serving engine for LLMs
☆47Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆17Sep 5, 2024Updated last year
- An extension to use Kokoro TTS in text generation webui☆22May 5, 2025Updated 10 months ago
- a ComfyUI custom node for MultiTalk☆32Jun 18, 2025Updated 8 months ago
- ComfyUI QwenVL and Qwen wrapper☆136Nov 29, 2025Updated 3 months ago
- Revision of official yolov7-pose to support custom dataset for keypoint detection☆11Nov 12, 2023Updated 2 years ago
- A simple stable-audio-open-1.0 node for ComfyUI.☆34Aug 10, 2024Updated last year
- Read image segmentation masks fast☆13Jul 25, 2024Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆81May 11, 2025Updated 9 months ago
- Text Corpus of African American Fiction and Poetry, from 1853-1923☆10Aug 5, 2020Updated 5 years ago
- *network spirits inviting chaos*☆16Updated this week
- A count down clock to embed in reveal.js presentations.☆11Jan 6, 2023Updated 3 years ago
- A meal planner and recipe management web application. Allowing the user to efficiently plan their weekly meals.☆12Feb 7, 2022Updated 4 years ago
- HealthiVert-GAN, a novel deep-learning framework designed to generate pseudo-healthy vertebral images. These images simulate the pre-frac…☆11Nov 3, 2025Updated 4 months ago
- Assembly language (汇编语言程序设计 第三版 王爽)☆12Aug 17, 2022Updated 3 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- ☆35Nov 14, 2024Updated last year
- ntegrate Topaz Photo AI's powerful image enhancement capabilities directly into your ComfyUI workflows.☆17May 24, 2025Updated 9 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆22Mar 2, 2026Updated last week
- Generate music videos starring yourself.☆11Apr 3, 2025Updated 11 months ago
- ☆15Jan 24, 2026Updated last month
- A collection of actions for working with ROS data☆14Jun 11, 2025Updated 8 months ago
- ☆17Jul 15, 2025Updated 7 months ago
- About Nam Ha Minh's GitHub☆16Feb 20, 2023Updated 3 years ago
- Rust MCP framework for building AI agents☆22Dec 28, 2025Updated 2 months ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- A virtual musical instrument built using Google MediaPipe.☆12Oct 10, 2022Updated 3 years ago
- A feature-rich Websocket IRC client in JavaScript☆12Dec 13, 2025Updated 2 months ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- PACT Agent Collaboration Layer☆15Feb 26, 2026Updated last week
- springboot环境下java调用c程序生成动态链接库(.so文件),并调用(基于JNI,Ubuntu)☆11Aug 26, 2024Updated last year
- An AI-powered web application leveraging Next.js 14 and TensorFlow.js for real-time object detection. Utilizing Tensorflow model for accu…☆12Dec 3, 2024Updated last year
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- Cholidean Harmony Structure☆31Mar 3, 2026Updated last week
- An auto clicker app that can discreetly keep your windows active through user interactions without interrupting your workflow, even when …☆13Dec 28, 2025Updated 2 months ago
- GazePlotter is a Svelte application for visualizing eye-tracking data. It automatically transforms eye gaze data to interactive scarf plo…☆14Updated this week
- Dockerfile for johnsmith0031/alpaca_lora_4bit☆12Apr 10, 2023Updated 2 years ago
- Run spleeter as a pulseaudio plugin in realtime☆12Mar 24, 2023Updated 2 years ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 2 years ago
- docker build nessus with unlimited ip☆13Aug 23, 2021Updated 4 years ago