LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆362Apr 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆79Apr 22, 2026Updated 3 weeks ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆32Nov 7, 2025Updated 6 months ago
- ☆18Oct 2, 2025Updated 7 months ago
- comfyui大炮工具箱,集合常用工具,方便日常使用☆56Apr 2, 2026Updated last month
- ☆103May 10, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [2025AIAgent / 2025InternLab]An agent that provides free and flexible access to Search external knowledge.☆23Feb 18, 2026Updated 3 months ago
- Agently Stage - Efficient Convenient Asynchronous & Multithreaded Programming☆13Apr 2, 2025Updated last year
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆27Updated this week
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Feb 5, 2024Updated 2 years ago
- The complete NUMA-optimized branch of the ktransformers project☆25Nov 3, 2025Updated 6 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆21Jun 26, 2025Updated 10 months ago
- 基于Nginx+Lua实现的页面安全认证☆12Nov 12, 2020Updated 5 years ago
- ComfyUI custom nodes for LTXV audio-video separation sampling and latent preparation. PainterSamplerLTXV: Advanced sampler with external…☆104Jan 20, 2026Updated 4 months ago
- chatGPT网页版,支持服务器部署、公网访问、自定义接口☆11Mar 20, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python的音频工具☆16Dec 5, 2025Updated 5 months ago
- 一个提示词管理工具,可以配置模型 API 进行调试,记录每次调试的提示词和模型返回,包含一个简单版本管理。☆22Dec 7, 2024Updated last year
- 这是一个基于FastAPI的智能视频识别系统,集成了Ollama大模型,能够实时处理RTSP视频流并提供AI驱动的内容识别功能。系统采用现代化的Web界面设计,支持多终端访问,为视频监控和内容分析提供了强大的解决方案。☆40Jun 17, 2025Updated 11 months ago
- vite-vue2-ts-template-starter☆10Mar 12, 2023Updated 3 years ago
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 5 years ago
- erniebot兼容openai的API调用方式,支持流式,非流式调用 ,支持system提示词☆20Apr 28, 2025Updated last year
- ☆16Jul 29, 2025Updated 9 months ago
- 将北航课表导入到各个平台的系统日历中☆10Mar 5, 2020Updated 6 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆55Jan 22, 2026Updated 3 months ago
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 7 months ago
- The main feature of this plugin is to quickly insert common Markdown code and HTML code, including Sup, Sub, Audio, Video, Iframe, Left-C…☆16May 11, 2024Updated 2 years ago
- ☆24Dec 31, 2025Updated 4 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆45May 1, 2025Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆253May 9, 2026Updated last week
- ☆23Nov 26, 2025Updated 5 months ago
- Hacker News☆15Updated this week
- iw4x server for Docker container☆13Apr 18, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Import Obsidian Vault in TiddlyWiki5☆12May 14, 2026Updated last week
- A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech ge…☆41Dec 11, 2025Updated 5 months ago
- See the favicon for a linked website.☆14Mar 4, 2023Updated 3 years ago
- Media(Video/Audio) Playback Enhancement for Obsidian.md☆10Jul 11, 2023Updated 2 years ago
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- This is a plugin for obsidian which highlights a block of text or a word as you scroll down while reading.☆11Feb 18, 2026Updated 3 months ago
- ☆270May 10, 2026Updated last week