guqiong96 / LvllmLinks
LvLLM is a special extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆66Updated this week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below
Sorting:
- zlai☆22Updated last year
- MCP Client as an Agent Strategy Plugin. Support GUI operation via UI-TARS-SDK.☆158Updated 4 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆128Updated 7 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆20Updated last week
- 用于提供给本地开发者的 LLM的高效互联网搜索&内容获取的MCP Server, 节省你的token☆120Updated 5 months ago
- dify-connector is a tool to publish Dify apps to various IM platforms. | dify-connector 是一个将 Dify 发布到各种 IM 平台的工具。☆99Updated last year
- ☆29Updated last year
- MinerU API server☆80Updated 10 months ago
- 无缝集成处理和调度 Dify & Dify on WeChat,Web 可视化多用户管理/一键启动 ChatBot,简化了令人惊叹且响应迅速的 ChatBot 应用程序的创建。☆71Updated last year
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆81Updated 10 months ago
- 一款基于 langchain + electron开发的多平台桌面端 Chat 客户端 支持本地知识库,tool调用,多个智能agent调用 目标尽量实现全离线本地可执行的智能agent☆54Updated 2 months ago
- ☆148Updated last year
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆182Updated 3 weeks ago
- 一个用于F5-TTS的api和webui项目☆64Updated 10 months ago
- 🔥 Turn entire websites into LLM-ready markdown☆96Updated last year
- ☆53Updated 10 months ago
- mirror of https://huggingface.co/spaces/enzostvs/deepsite☆76Updated 7 months ago
- Using GPT to parse PDF☆101Updated last year
- By invoking local large language models, this tool processes spreadsheets similar to multi-dimensional tables. It can batch-generate cont…☆38Updated 7 months ago
- 强大的MCP翻译服务器!#AiryLarkMCP 🌐 专为专业翻译人员设计: • 三阶段翻译流程:分析规划、精准翻译、全文审校 • 自动识别专业领域术语 • 提供全面翻译质量评估 • 支持多语种互译 • 保持原文风格与专业性 💯 无缝集成Claude/Cursor等支持…☆23Updated 7 months ago
- Stagehand-GLM 是基于 stagehand-python 深度定制的AI浏览器自动化框架,专门适配了智谱AI的GLM文本和多模态大模型。它提供了渐进式的RPA操作策略,让开发者在智能便捷和成本效益之间找到最佳平衡点。☆23Updated 2 months ago
- A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.☆169Updated 3 months ago
- ☆107Updated last month
- A open version Manus.☆67Updated 7 months ago
- Agents of C.L.I.☆139Updated 2 months ago
- 一个中文语音转文字项目,封装自FireRedASR☆80Updated 8 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆74Updated 5 months ago
- 与 https://github.com/tonori/mem0ai-api 配合使用的非官方的 mem0ai provider.☆47Updated last year
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆164Updated last year