LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆283Mar 20, 2026Updated last week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆46Mar 12, 2026Updated 2 weeks ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 4 months ago
- kun-chat is a lightweight AI conversation app based on Ollama/kun-chat 是一款基于 Ollama 的轻量级 AI 对话应用☆10Jul 16, 2025Updated 8 months ago
- comfyui大炮工具箱,集合常用工具,方便日常使用☆38Feb 27, 2026Updated last month
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆81Jan 14, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Experimental Realization of Asynchronous Symbiotic Compilation in PyTorch 2.8☆16Apr 25, 2025Updated 11 months ago
- fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tp…☆4,176Mar 19, 2026Updated last week
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 2 months ago
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- The complete NUMA-optimized branch of the ktransformers project☆25Nov 3, 2025Updated 4 months ago
- ☆19Jul 4, 2025Updated 8 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆19Jun 26, 2025Updated 9 months ago
- 基于Nginx+Lua实现的页面安全认证☆12Nov 12, 2020Updated 5 years ago
- chatGPT网页版,支持服务器部署、公网访问、自定义接口☆11Mar 20, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 这是一个基于FastAPI的智能视频识别系统,集成了Ollama大模型,能够实时处理RTSP视频流并提供AI驱动的内容识别功能。系统采用现代化的Web界面设计,支持多终端访问,为视频监控和内容分析提供了强大的解决方案。☆38Jun 17, 2025Updated 9 months ago
- The project purpose is to develop a comprehensive, robust open source PLM (Product LifeCycle Management) solution.☆20Jul 24, 2023Updated 2 years ago
- 一个提示词管理工具,可以配置模型 API 进行调试,记录每次调试的提示词和模型返回,包含一个简单版本管理。☆22Dec 7, 2024Updated last year
- 协议版sora批量注册机,支持多线程,免费体验PLUS完善中.技术交流QQ群382446☆56Feb 22, 2026Updated last month
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 5 years ago
- 双图滑动验证码识别工具,支持 docker 部署,HTTP API 访问,支持鼠标轨迹生成。☆18Sep 24, 2023Updated 2 years ago
- ☆17Sep 24, 2016Updated 9 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated last month
- A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech ge…☆36Dec 11, 2025Updated 3 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆51Jan 22, 2026Updated 2 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆45May 1, 2025Updated 10 months ago
- ☆24Dec 31, 2025Updated 2 months ago
- This is a plugin for obsidian. The Goal of this plugin is making Obsidian canvas easier to edit. (inspired by heptabase)☆14Sep 29, 2023Updated 2 years ago
- 整理的vSphere Management SDK,使之能够通过idea编译运行☆14Jul 7, 2018Updated 7 years ago
- 一个为即梦AI打造的MCP服务器,让Claude、Cherry Studio等AI应用直 接调用即梦的AI生成能力。基于jimeng-free-api-all开源项目,提供OpenAI兼容接口。 核心功能:文本生成图像(即梦4.0/3.1)、图像合成(多图融合)、文本生…☆41Updated this week
- ☆11Dec 9, 2019Updated 6 years ago
- ComfyUI node for Infinite You for identity preservation. Supports multiple characters, face pose and more☆19Mar 30, 2025Updated 11 months ago
- ☆10May 27, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- 基于springmvc+mybatis的日报、周报、月报和项目、人员管理系统☆15Apr 3, 2018Updated 7 years ago
- 大语言模型工具集☆25Aug 1, 2025Updated 7 months ago
- AutoDev Workbench is an AI-native developer platform designed to accelerate, automate, and contextualize modern software development work…☆77Oct 22, 2025Updated 5 months ago
- 解决尺寸偏移,以及尺寸限制问题☆22Jan 5, 2026Updated 2 months ago
- incredible AI-Prompts Sharing Web Application - modern full-stack Next.js 13 application powered by MongoDB 🤖☆18May 13, 2024Updated last year
- ☆12May 16, 2024Updated last year