guqiong96 / LvllmLinks
LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.
☆110Updated this week
Alternatives and similar repositories for Lvllm
Users that are interested in Lvllm are comparing it to the libraries listed below
Sorting:
- MCP Client as an Agent Strategy Plugin. Support GUI operation via UI-TARS-SDK.☆163Updated 6 months ago
- 用于提供给本地开发者的 LLM的高效互联网搜索&内容获取的MCP Server, 节省你的token☆126Updated 8 months ago
- Convert files into markdown to help RAG or LLM understand, based on markitdown and MinerU, which could provide high quality pdf parser.☆131Updated 9 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆259Updated 10 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆371Updated 2 weeks ago
- MinerU API server☆84Updated last year
- ☆107Updated last month
- ☆94Updated 6 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆82Updated last year
- zlai☆23Updated last year
- ☆29Updated last year
- LLM voice chat project by Connect ChatTTS with Local Ollama, 连接本地部署的 Ollama 和 ChatTTS,实现和LLM的语音对话☆65Updated last year
- Integrate Open-AutoGLM's Android & iOS GUI automation into DeepAgents-CLI via LangChain Middleware, combining LLM orchestration with visi…☆81Updated last week
- A tool for creating pre-training datasets for language models, supporting one-click batch processing for both text and image datasets. 一个…☆43Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆70Updated last week
- Agents of C.L.I.☆141Updated 4 months ago
- ☆149Updated last year
- LM inference server implementation based on *.cpp.☆294Updated last month
- 世界上最好的MCP Servers的列表,The best mcp servers in the world.☆108Updated 9 months ago
- ☆40Updated 10 months ago
- mirror of https://huggingface.co/spaces/enzostvs/deepsite☆78Updated 9 months ago
- A DIFY plugin used to render the HTML code output of LLM☆90Updated 8 months ago
- The showcase page of IndexTTS2☆178Updated 4 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆169Updated last year
- Stagehand-GLM 是基于 stagehand-python 深度定制的AI浏览器自动化框架,专门适配了智谱AI的GLM文本和多模态大模型。它提供了渐进式的RPA操作策略,让开发者在智能便捷和成本效益之间找到最佳平衡点。☆27Updated 5 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆105Updated last year
- dify-connector is a tool to publish Dify apps to various IM platforms. | dify-connector 是一个将 Dify 发布到各种 IM 平台的工具。☆101Updated last year
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆90Updated last year
- A open version Manus.☆66Updated 9 months ago
- mcp的webui界面,支持客户端连接多个sse服务端,支持 openai、deepseek、qwen等大模型,另外附上构建的 agent的 stdio和sse的简单 天气查询的完整示例☆39Updated 7 months ago