guqiong96 / LvllmView external linksLinks
LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆169Feb 9, 2026Updated last week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below
Sorting:
- A source repo of Postgres Chinese full-test search docker image, based on zhparser.☆10Mar 25, 2021Updated 4 years ago
- python+mt5客户端实现策略回测☆19Jun 9, 2021Updated 4 years ago
- AutoDev Workbench is an AI-native developer platform designed to accelerate, automate, and contextualize modern software development work…☆75Oct 22, 2025Updated 3 months ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- SPO | Self-Supervised Prompt Optimization☆28Mar 4, 2025Updated 11 months ago
- kun-chat is a lightweight AI conversation app based on Ollama/kun-chat 是一款基于 Ollama 的轻量级 AI 对话应用☆10Jul 16, 2025Updated 7 months ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- A Firefox Web Extension to run the DuckDuckGo AI Chat page on your sidebar☆12Aug 7, 2025Updated 6 months ago
- ☆12Aug 4, 2024Updated last year
- 根据okex-api的相关衍生小脚本☆12Sep 2, 2021Updated 4 years ago
- AI小说洗文工具☆17Jul 19, 2025Updated 6 months ago
- siyuan-plugin-picture-library☆13Dec 2, 2025Updated 2 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- ☆12Jun 9, 2018Updated 7 years ago
- 本项目基于RuoYi-Vue框架为xiaozhi-esp32提供Java后端聊天服务器。帮助个人、企业快速部署的xiaozhi-esp32后端服务。☆21Jun 19, 2025Updated 7 months ago
- 使用R语言的jiebaR包的情感分析☆10May 15, 2017Updated 8 years ago
- An editor for the tagui☆44May 17, 2023Updated 2 years ago
- openai chatgpt or local llm(llama.cpp gguf format)+TTS+STT+Word+Excel☆101Jul 1, 2024Updated last year
- ☆12May 16, 2024Updated last year
- ☆10Mar 9, 2019Updated 6 years ago
- API server for F5-TTS☆19Jan 24, 2026Updated 3 weeks ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Feb 4, 2026Updated last week
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- 一个高性能的ComfyUI视频&图像放大插件,利用CUDA加速,支持多GPU、混合精度和Tensor Core优化。☆22Oct 26, 2025Updated 3 months ago
- Node.js library for the Tradier API☆11Feb 12, 2022Updated 4 years ago
- 提供了一个极简的发电文案接口和一些云崽插件☆11Jan 17, 2025Updated last year
- MCP server that gives Claude instant, intelligent access to your codebase using Language Server Protocol☆17Jun 27, 2025Updated 7 months ago
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 7 years ago
- Subdivider converts your notes into nested folders, automatically creating separate files for each subheading.☆15Aug 14, 2024Updated last year
- ☆11Apr 13, 2017Updated 8 years ago
- S-Drama 短剧引擎 ai 全栈☆16Jun 22, 2025Updated 7 months ago
- Option Selling Algorithm built upon the Interactive Brokers Python API☆10Oct 9, 2020Updated 5 years ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- 🤖 It is basic tools build for scraping & embedding text. The main technologies included OpenAI embeddings, Supabase and Next.js.☆16Apr 12, 2023Updated 2 years ago
- CV approach aimed to remove moving objects in videos (dynamic and static camera)☆11Mar 21, 2021Updated 4 years ago
- ☆13Jan 16, 2019Updated 7 years ago
- a python implementation of the strategy as described in "Street Smarts: High Probability Short-Term Trading Strategies" by Linda Raschke☆11Oct 21, 2019Updated 6 years ago
- Code for the Nadaraya-Watson Head - an interpretable/explainable, nonparametric classification head which can be used with any neural net…☆12Jul 16, 2024Updated last year
- The main feature of this plugin is to quickly insert common Markdown code and HTML code, including Sup, Sub, Audio, Video, Iframe, Left-C…☆16May 11, 2024Updated last year