LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆250Mar 7, 2026Updated this week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below
Sorting:
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆38Updated this week
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 4 months ago
- kun-chat is a lightweight AI conversation app based on Ollama/kun-chat 是一款基于 Ollama 的轻量级 AI 对话应用☆10Jul 16, 2025Updated 7 months ago
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 4 years ago
- python+mt5客户端实现策略回测☆19Jun 9, 2021Updated 4 years ago
- AutoDev Workbench is an AI-native developer platform designed to accelerate, automate, and contextualize modern software development work…☆75Oct 22, 2025Updated 4 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- SPO | Self-Supervised Prompt Optimization☆28Mar 4, 2025Updated last year
- ☆12Oct 31, 2024Updated last year
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆74Jan 14, 2026Updated last month
- A two phase Fourier neural operator based model for predicting pressures and saturations in porous media☆13May 22, 2025Updated 9 months ago
- very fast python backtesting framework based on amibroker backtesting methodology☆40Dec 6, 2017Updated 8 years ago
- ☆35Jul 31, 2024Updated last year
- 基于GPT的自主AI Agent,可以使用联网搜索、本地知识库查询等工具,根据指定的研究目标写报告,例如写市场分析报告、科研报告☆39Mar 23, 2024Updated last year
- 根据okex-api的相关衍生小脚本☆12Sep 2, 2021Updated 4 years ago
- ☆13Jan 31, 2026Updated last month
- 🎈 Easy-to-use video player for Vue 3.x☆12Aug 22, 2023Updated 2 years ago
- ☆14Jun 10, 2025Updated 9 months ago
- 本项目是基于coze-studio项目进行的二次开发,遵循其Apache 2.0 协议许可证。主要修改并使用其工作流部分的代码,作为联通元景万悟智能体平台的工作流模块。☆28Feb 28, 2026Updated last week
- AI小说洗文工具☆17Jul 19, 2025Updated 7 months ago
- ☆12Mar 26, 2020Updated 5 years ago
- ☆12Jun 9, 2018Updated 7 years ago
- 本项目基于RuoYi-Vue框架为xiaozhi-esp32提供Java后端聊天服务器。帮助个人、企业快速部署的xiaozhi-esp32后端服务。☆21Jun 19, 2025Updated 8 months ago
- ☆42Jan 17, 2026Updated last month
- An editor for the tagui☆45May 17, 2023Updated 2 years ago
- openai chatgpt or local llm(llama.cpp gguf format)+TTS+STT+Word+Excel☆101Jul 1, 2024Updated last year
- S-Drama 短剧引擎 ai 全栈☆17Jun 22, 2025Updated 8 months ago
- Subdivider converts your notes into nested folders, automatically creating separate files for each subheading.☆15Aug 14, 2024Updated last year
- 这是一个为大模型提供 A 股数据的的 MCP(Model Content Protocol) 服务。☆20Aug 31, 2025Updated 6 months ago
- ☆10Sep 10, 2024Updated last year
- a python implementation of the strategy as described in "Street Smarts: High Probability Short-Term Trading Strategies" by Linda Raschke☆11Oct 21, 2019Updated 6 years ago
- A sd-webui extension for utilizing DanTagGen to "upsample prompts".☆13Jun 13, 2024Updated last year
- ☆13Jan 16, 2019Updated 7 years ago
- Code for the Nadaraya-Watson Head - an interpretable/explainable, nonparametric classification head which can be used with any neural net…☆12Jul 16, 2024Updated last year
- Option Selling Algorithm built upon the Interactive Brokers Python API☆10Oct 9, 2020Updated 5 years ago
- API server for F5-TTS☆20Jan 24, 2026Updated last month
- A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M.,…☆10Dec 7, 2020Updated 5 years ago
- 一个高性能的ComfyUI视频&图像放大插件,利用CUDA加速,支持多GPU、混合精度和Tensor Core优化。☆22Oct 26, 2025Updated 4 months ago
- 🧮A Domain-Specific Language (DSL) Approach for Triggering Commands. 📎Generating DSL scripts using LLM and user queries to execute offi…☆13Jul 31, 2025Updated 7 months ago