LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆322Apr 7, 2026Updated last week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 2, 2025Updated 6 months ago
- ☆11Apr 8, 2022Updated 4 years ago
- comfyui大炮工具箱,集合常用工具,方便日常使用☆46Apr 2, 2026Updated 2 weeks ago
- Experimental Realization of Asynchronous Symbiotic Compilation in PyTorch 2.8☆16Apr 25, 2025Updated 11 months ago
- ☆85Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository accompanies our Cell Metabolism manuscript "Plasma protein-based organ-specific aging and mortality models unveil disease…☆11Jan 5, 2026Updated 3 months ago
- Agently Stage - Efficient Convenient Asynchronous & Multithreaded Programming☆13Apr 2, 2025Updated last year
- ☆12Mar 20, 2024Updated 2 years ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆20Jun 26, 2025Updated 9 months ago
- ComfyUI custom nodes for LTXV audio-video separation sampling and latent preparation. PainterSamplerLTXV: Advanced sampler with external…☆103Jan 20, 2026Updated 2 months ago
- 从零开始制作一个linux iso镜像☆23Dec 24, 2021Updated 4 years ago
- GUI app for Windows to hide Mouse Pointer.☆14Mar 26, 2024Updated 2 years ago
- 这是一个带管理界面的AI对话转换和转发平台,参考了ONE-API,初期的业务逻辑和实现方式都会参考它。☆25Sep 15, 2024Updated last year
- Python bindings for ros2_control with pybind11☆12Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is a short project using www.postgis.net , and www.github.com/pramsey/pointcloud to store efficentlly large point clouds in a www.…☆14Apr 14, 2015Updated 11 years ago
- Learn Diffusion Models☆19Sep 1, 2025Updated 7 months ago
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 5 years ago
- erniebot兼容openai的API调用方式,支持流式,非流式调用 ,支持system提示词☆20Apr 28, 2025Updated 11 months ago
- ☆16Jul 29, 2025Updated 8 months ago
- 一个简单的Godot游戏Demo,目标是实现手机上的RPG游戏,可以多人战斗,回合制,自动战斗☆13Aug 14, 2022Updated 3 years ago
- Diagnostics framework for micro-ROS☆10Jun 4, 2025Updated 10 months ago
- ☆17Sep 24, 2016Updated 9 years ago
- ComfyUI常用节点插件收藏插件☆19Oct 24, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech ge…☆37Dec 11, 2025Updated 4 months ago
- LTX2 infinite length video generation Comfyui workflow based on the Stable-Video-Infinity concept and workflow☆53Jan 22, 2026Updated 2 months ago
- Android本地运行mnn-llm语言模型简单示例☆13Oct 2, 2025Updated 6 months ago
- The main feature of this plugin is to quickly insert common Markdown code and HTML code, including Sup, Sub, Audio, Video, Iframe, Left-C…☆16May 11, 2024Updated last year
- ☆23Mar 13, 2025Updated last year
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆250Mar 22, 2026Updated 3 weeks ago
- ☆22Nov 26, 2025Updated 4 months ago
- ☆23Oct 23, 2024Updated last year
- This is a plugin for obsidian. The Goal of this plugin is making Obsidian canvas easier to edit. (inspired by heptabase)☆14Sep 29, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The PyGpPhs package is designed for utilizing Gaussian Process for port-Hamiltonian system.☆11May 6, 2024Updated last year
- Hacker News☆14Updated this week
- See the favicon for a linked website.☆14Mar 4, 2023Updated 3 years ago
- ipython notebooks do some sample experiments , make some idea☆25Jan 27, 2026Updated 2 months ago
- A python framework base on David Harel's statecharts (SCXML).☆16Dec 13, 2025Updated 4 months ago
- Media(Video/Audio) Playback Enhancement for Obsidian.md☆10Jul 11, 2023Updated 2 years ago
- This is a plugin for obsidian which highlights a block of text or a word as you scroll down while reading.☆11Feb 18, 2026Updated 2 months ago