fyabc/vllm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fyabc/vllm)

fyabc / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

☆49

Alternatives and similar repositories for vllm

Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hyc2026 / sft-qwen2.5-omni-thinker
View on GitHub
verl: Volcano Engine Reinforcement Learning for LLMs
☆42Jun 23, 2025Updated last year
aringlis / afino_release_version
View on GitHub
☆10Jul 1, 2024Updated 2 years ago
BorealisAI / ssl-for-timeseries
View on GitHub
Self Supervised Learning for Time Series Using Similarity Distillation
☆11Jun 29, 2022Updated 4 years ago
Gaiejj / align-anything
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
fengnian123 / qwen-2.5-omni-realtime-chat
View on GitHub
使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等
☆14Jun 27, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jcwang0602 / VPTracker
View on GitHub
VPTracker: Global Vision-Language Tracking via Visual Prompt and MLLM
☆16Mar 10, 2026Updated 4 months ago
AIFSH / SemiChat-ComfyUI
View on GitHub
☆12Feb 19, 2025Updated last year
noemaresearch / pinboard
View on GitHub
Pin files for contextual, codebase-level AI assistance.
☆16Jul 11, 2024Updated 2 years ago
callbacked / qwen3-mcp
View on GitHub
An MCP-enabled Qwen3 0.6B demo with adjustable thinking budget, all in your browser!
☆28Jun 2, 2025Updated last year
Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
View on GitHub
基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …
☆91May 11, 2025Updated last year
ShirazAdam / TinyWall
View on GitHub
TinyWall is a free, non-intrusive, secure-by-default firewall for Windows.
☆12Jun 18, 2026Updated last month
DongSky / MR-GDINO
View on GitHub
☆54Dec 23, 2024Updated last year
GraftingRayman / ComfyUI_GraftingRayman
View on GitHub
Nodes for ComfyUI to simply workflows
☆76Jun 29, 2026Updated 3 weeks ago
PanasonicConnect / InvReg
View on GitHub
Invariant Feature Regularization for Fair Face Recognition (ICCV'23)
☆15Oct 23, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hatsu3 / curator
View on GitHub
☆13Jan 17, 2024Updated 2 years ago
shivamsanju / ragswift
View on GitHub
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Jan 29, 2024Updated 2 years ago
alexcong / ComfyUI_QwenVL
View on GitHub
ComfyUI QwenVL and Qwen wrapper
☆145Nov 29, 2025Updated 7 months ago
dicksondickson / ComfyUI-Clean-Install
View on GitHub
Scripts help setup a clean install of ComfyUI and supporting tools
☆26Aug 19, 2024Updated last year
yeweiyangxinci / SentimentAnalysis_api
View on GitHub
使用django对情感分析功能进行封装，里面包含使用情感词典和Bert模型进行情感分类，最后可以使用tensorFlow serving将模型部署在docker中运行。
☆13Sep 23, 2019Updated 6 years ago
M1n9X / GraphRAG_Lite
View on GitHub
☆16Jul 12, 2024Updated 2 years ago
pritamqu / OOD-VSSL
View on GitHub
[NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts
☆13Jan 30, 2024Updated 2 years ago
liumy2010 / UFT
View on GitHub
UFT: Unifying Supervised and Reinforcement Fine-Tuning
☆31Jun 30, 2025Updated last year
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
co2-git / reactors
View on GitHub
View components and APIs that work web, mobile and native!
☆14Jan 20, 2018Updated 8 years ago
HenkDz / rgthree-comfy
View on GitHub
Making ComfyUI more comfortable!
☆19Sep 13, 2025Updated 10 months ago
yangyaofei / dify-vllm-provider
View on GitHub
☆30May 12, 2026Updated 2 months ago
liuhuanshuo / notes-python
View on GitHub
中文 Python 笔记
☆12Jan 15, 2018Updated 8 years ago
Amorano / Jovi_GLSL
View on GitHub
ComfyUI Nodes that integrate GLSL shader support.
☆20Aug 25, 2025Updated 11 months ago
domingomery / Xdefects
View on GitHub
Automatic defect recognition in X-ray testing using computer vision
☆13Dec 8, 2018Updated 7 years ago
LordKa-Berlin / ImagePromptViewer
View on GitHub
A Python script to display PNG images (e.g. from StableDiffusion) and extract prompt, negative prompt, and settings from text chunks. Off…
☆21Apr 9, 2025Updated last year
FearL0rd / ComfyUI-Flash-Attention_v100
View on GitHub
A ComfyUI custom node enabling **Flash Attention 1** on legacy NVIDIA GPUs (Tesla V100, T4) that lack Compute Capability 8.0+ required by…
☆15Feb 9, 2026Updated 5 months ago
GentlemanHu / ComfyUI-SunoAI
View on GitHub
ComfyUI Node wrapper of SunoAI API
☆21Dec 17, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lynnliu030 / artifact-eval
View on GitHub
☆13Apr 9, 2025Updated last year
skandavivek / web-qa
View on GitHub
☆11Feb 25, 2024Updated 2 years ago
AmphionTeam / SD-Eval
View on GitHub
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
☆57Jun 25, 2024Updated 2 years ago
gaurav16gupta / constrainedANN
View on GitHub
☆14Jan 20, 2025Updated last year
Athe-kunal / SEC-Summarize-Project
View on GitHub
Summarize SEC documents using LLMs
☆14Aug 23, 2023Updated 2 years ago
Jisencc / yolov7-keypoint-customization
View on GitHub
Revision of official yolov7-pose to support custom dataset for keypoint detection
☆11Nov 12, 2023Updated 2 years ago
LearnWeb3DAO / DAOHacks-Workshop
View on GitHub
LearnWeb3: How to build an on-chain DAO with automatic proposal execution. Workshop done for ETHGlobal DAOHacks
☆10Apr 7, 2022Updated 4 years ago