guqiong96 / LvllmLinks
LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆94Updated last week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below
Sorting:
- ☆108Updated 3 weeks ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆29Updated last month
- ☆94Updated 5 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Updated last year
- 用于提供给本地开发者的 LLM的高效互联网搜索&内容获取的MCP Server, 节省你的token☆125Updated 7 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆259Updated 9 months ago
- ☆29Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆68Updated this week
- 一个中文语音转文字项目,封装自FireRedASR☆81Updated 10 months ago
- LM inference server implementation based on *.cpp.☆294Updated last month
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆360Updated 2 months ago
- Mission intent compiler and autonomy supervisor for unmanned systems.☆144Updated 2 weeks ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆131Updated 9 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year
- MCP Client as an Agent Strategy Plugin. Support GUI operation via UI-TARS-SDK.☆160Updated 5 months ago
- Code for ACL25-findings. An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social g…☆90Updated 2 months ago
- Stagehand-GLM 是基于 stagehand-python 深度定制的AI浏览器自动化框架,专门适配了智谱AI的GLM文本和多模态大模型。它提供了渐进式的RPA操作策略,让开发者在智能便捷和成本效益之间找到最佳平衡点。☆26Updated 4 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆179Updated last month
- ☆133Updated 8 months ago
- Cross Platform Open Sourced Chinese NoteBookLM app based on Electron, Use DeepSeek + Reecho.ai☆82Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- Library for model distillation☆158Updated 3 months ago
- Using GPT to parse PDF☆102Updated last year
- zlai☆23Updated last year
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆186Updated last week
- A pure Markdown documents showcase☆35Updated 11 years ago
- A open version Manus.☆68Updated 9 months ago
- OpenAI-compatible APIs for Dify platform services.☆31Updated last year
- ☆149Updated last year
- ☆41Updated 9 months ago