maaaxinfinity / ktrunLinks
KTransformers 一键部署脚本
☆50Updated 4 months ago
Alternatives and similar repositories for ktrun
Users that are interested in ktrun are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆250Updated 5 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,245Updated this week
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆194Updated last week
- ☆164Updated 5 months ago
- ☆33Updated 3 months ago
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问题。☆82Updated 4 months ago
- LM inference server implementation based on *.cpp.☆271Updated 2 weeks ago
- 使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话,并拥有美观的界面。☆121Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆56Updated 10 months ago
- Simple, scalable AI model deployment on GPU clusters☆3,553Updated this week
- Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用☆906Updated 2 weeks ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆257Updated 5 months ago
- Community maintained hardware plugin for vLLM on Ascend☆1,048Updated this week
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆107Updated last year
- 微软开源多Agent智能体协作框架AutoGen全新改版核心概念介绍及相关案例测试☆46Updated 8 months ago
- MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆240Updated 8 months ago
- 大模型中文测试题库-民间版本☆90Updated 2 years ago
- KAG开源框架介绍及使用KAG实现知识增强生成应用(产品模式测试、开发者模式测试),KAG是OpenSPG发布v0.5版本中推出的知识增强生成(KAG)的专业领域知识服务框架,旨在充分利用知识图谱和向量检索的优势,增强大型语言模型和知识图谱,以解决 RAG 挑战☆138Updated 4 months ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆151Updated 5 months ago
- The AI-powered PPT generation service based on ChatPPT can create presentations based on themes, requirements, or uploaded documents, sup…☆87Updated this week
- KnowFlowRAG☆257Updated this week
- Less Code, Lower Barrier, Faster Deployment☆747Updated this week
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆160Updated 9 months ago
- Ollama 中文文档☆46Updated last year
- DIFY PULGIN 插件源码集合☆287Updated 2 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆206Updated this week
- torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics c…☆434Updated 2 weeks ago
- 自动批量上传并解析文档至 RagFlow 知识库,省去手动操作,提升效率。☆422Updated 3 weeks ago
- ☆263Updated 8 months ago
- ☆379Updated 3 months ago