LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆372Jun 8, 2026Updated this week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆83Updated this week
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆33Nov 7, 2025Updated 7 months ago
- ☆18Oct 2, 2025Updated 8 months ago
- A modern low-code visual programming IDE built on NodeGraphQt and qfluentwidgets, supporting drag-and-drop component orchestration, async…☆176Apr 30, 2026Updated last month
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,756Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Experimental Realization of Asynchronous Symbiotic Compilation in PyTorch 2.8☆16Apr 25, 2025Updated last year
- ☆107Updated this week
- [2025AIAgent / 2025InternLab]An agent that provides free and flexible access to Search external knowledge.☆23Feb 18, 2026Updated 3 months ago
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆30May 17, 2026Updated 3 weeks ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Feb 5, 2024Updated 2 years ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆22Jun 26, 2025Updated 11 months ago
- ComfyUI custom nodes for LTXV audio-video separation sampling and latent preparation. PainterSamplerLTXV: Advanced sampler with external…☆106Jan 20, 2026Updated 4 months ago
- 一个提示词管理工具,可以配置模型 API 进行调试,记录每次调试的提示词和模型返回,包含一个简单版本管理。☆22Dec 7, 2024Updated last year
- ☆35Jul 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 这是一个基于FastAPI的智能视频识别系统,集成了Ollama大模型,能够实时处理RTSP视频流并提供AI驱动的内容识别功能。系统采用现代化的Web界面设计,支持多终端访问,为视频监控和内容分析提供了强大的解决方案。☆40Jun 17, 2025Updated 11 months ago
- vite-vue2-ts-template-starter☆10Mar 12, 2023Updated 3 years ago
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 5 years ago
- erniebot兼容openai的API调用方式,支持流式,非流式调用 ,支持system提示词☆20Apr 28, 2025Updated last year
- ☆16Jul 29, 2025Updated 10 months ago
- 一个简单的Godot游戏Demo,目标是实现手机上的RPG游戏,可以多人战斗,回合制,自动战斗☆13Aug 14, 2022Updated 3 years ago
- ComfyUI常用节点插件收藏插件☆19Oct 24, 2025Updated 7 months ago
- ☆17Sep 24, 2016Updated 9 years ago
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆56Jan 22, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The main feature of this plugin is to quickly insert common Markdown code and HTML code, including Sup, Sub, Audio, Video, Iframe, Left-C…☆16May 11, 2024Updated 2 years ago
- NVIDIA Linux open GPU with P2P support☆268Jun 2, 2026Updated last week
- KTransformers 一键部署脚本☆59Apr 18, 2025Updated last year
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆45May 1, 2025Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆255May 9, 2026Updated last month
- ☆23Nov 26, 2025Updated 6 months ago
- This is a plugin for obsidian. The Goal of this plugin is making Obsidian canvas easier to edit. (inspired by heptabase)☆14Sep 29, 2023Updated 2 years ago
- Import Obsidian Vault in TiddlyWiki5☆12Updated this week
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech ge…☆41Dec 11, 2025Updated 5 months ago
- See the favicon for a linked website.☆14Mar 4, 2023Updated 3 years ago
- 整理的vSphere Management SDK,使之能够通过idea编译运行☆14Jul 7, 2018Updated 7 years ago
- C# DDE Client for MetaTrader 4 (via Ndde)☆10Jan 1, 2018Updated 8 years ago
- Media(Video/Audio) Playback Enhancement for Obsidian.md☆10Jul 11, 2023Updated 2 years ago
- 关于Multicharts程序化交易的基础代码(画图,交易,打印输出等)☆10May 21, 2019Updated 7 years ago
- ☆11Dec 9, 2019Updated 6 years ago