LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆377Jun 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆32Nov 7, 2025Updated 7 months ago
- ☆18Oct 2, 2025Updated 8 months ago
- comfyui大炮工具箱,集合常用工具,方便日常使用☆68Jun 17, 2026Updated 2 weeks ago
- A modern low-code visual programming IDE built on NodeGraphQt and qfluentwidgets, supporting drag-and-drop component orchestration, async…☆180Apr 30, 2026Updated 2 months ago
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,812Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆112Updated this week
- 本项目是基于coze-studio项目进行的二次开发,遵循其Apache 2.0 协议许可证。主要修改并使用其工作流部分的代码,作为联通元景万悟智能体平台的工作流模块。☆36Jun 12, 2026Updated 2 weeks ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆101May 26, 2026Updated last month
- iptables in web browser☆11Mar 13, 2020Updated 6 years ago
- ☆76Apr 1, 2026Updated 2 months ago
- The complete NUMA-optimized branch of the ktransformers project☆25Nov 3, 2025Updated 7 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆22Jun 26, 2025Updated last year
- 这是一个基于FastAPI的智能视频识别系统,集成了Ollama大模型,能够实时处理RTSP视频流并提供AI驱动的内容识别功能。系统采用现代化的Web界面设计,支持多终端访问,为视频监控和内容分析提供了强大的解决方案。☆40Jun 17, 2025Updated last year
- erniebot兼容openai的API调用方式,支持流式,非流式调用 ,支持system提示词☆20Apr 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a VideoAsPrompt ComfyUI plugin☆21Oct 30, 2025Updated 8 months ago
- 自用发票排版打印工具,目前仅支持纵向单页两张发票布局,调用微软edge浏览器来实现打印(后续可能会实现自写打印页 面)☆34Dec 1, 2025Updated 6 months ago
- ☆37Jun 9, 2026Updated 3 weeks ago
- 大模型推理框架加速,让 LLM 飞起来☆24May 10, 2024Updated 2 years ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆254May 9, 2026Updated last month
- ☆23Nov 26, 2025Updated 7 months ago
- This is a plugin for obsidian. The Goal of this plugin is making Obsidian canvas easier to edit. (inspired by heptabase)☆14Sep 29, 2023Updated 2 years ago
- iw4x server for Docker container☆13Apr 18, 2025Updated last year
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech ge…☆42Dec 11, 2025Updated 6 months ago
- See the favicon for a linked website.☆14Mar 4, 2023Updated 3 years ago
- 整理的vSphere Management SDK,使之能够通过idea编译运行☆14Jul 7, 2018Updated 7 years ago
- C# DDE Client for MetaTrader 4 (via Ndde)☆10Jan 1, 2018Updated 8 years ago
- Media(Video/Audio) Playback Enhancement for Obsidian.md☆10Jul 11, 2023Updated 2 years ago
- This is a plugin for obsidian which highlights a block of text or a word as you scroll down while reading.☆12Feb 18, 2026Updated 4 months ago
- 关于Multicharts程序化交易的基础代码(画图,交易,打印输出等)☆10May 21, 2019Updated 7 years ago
- Use sing-box, clash, v2ray, xray tunnel proxy on Android devices.☆16Oct 23, 2024Updated last year
- ComfyUI node for Infinite You for identity preservation. Supports multiple characters, face pose and more☆19Mar 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10May 27, 2025Updated last year
- ☆295May 10, 2026Updated last month
- 解决尺寸偏移,以及尺寸限制问题☆23Jan 5, 2026Updated 5 months ago
- A ComfyUI custom node implementation of TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows.☆44Mar 6, 2026Updated 3 months ago
- AutoDev Workbench is an AI-native developer platform designed to accelerate, automate, and contextualize modern software development work…☆75Oct 22, 2025Updated 8 months ago
- Subdivider converts your notes into nested folders, automatically creating separate files for each subheading.☆15Aug 14, 2024Updated last year
- ☆12May 16, 2024Updated 2 years ago