maaaxinfinity / ktrunLinks
KTransformers 一键部署脚本
☆51Updated 5 months ago
Alternatives and similar repositories for ktrun
Users that are interested in ktrun are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆251Updated 6 months ago
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆247Updated this week
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,278Updated this week
- triton3.2.0添加mi25/mi50/mi60支持☆14Updated 4 months ago
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问题。☆85Updated 4 months ago
- ☆35Updated 4 months ago
- ☆166Updated 5 months ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆43Updated 4 months ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆159Updated 5 months ago
- Community maintained hardware plugin for vLLM on Ascend☆1,128Updated this week
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆13Updated last week
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆209Updated this week
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆261Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆62Updated 10 months ago
- torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics c…☆432Updated this week
- Run generative AI models in sophgo BM1684X/BM1688☆240Updated last week
- KnowFlowRAG☆287Updated this week
- llamafactory blog☆40Updated 11 months ago
- Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用☆1,015Updated 2 weeks ago
- 大模型中文测试题库-民间版本☆88Updated 2 years ago
- ☆265Updated 8 months ago
- KAG开源框架介绍及使用KAG实现知识增强生成应用(产品模式测试、开发者模式测试),KAG是OpenSPG发布v0.5版本中推出的知识增强生成(KAG)的专业领域知识服务框架,旨在充分利用知识图谱和向量检索的优势,增强大型语言模型和知识图谱,以解决 RAG 挑战☆144Updated 5 months ago
- AI虚拟伙伴Linux版☆109Updated last month
- ☆287Updated 7 months ago
- Scripting tool for downloading Dify plugin package from Dify Marketplace and Github and repackaging [true] offline package.☆447Updated 2 weeks ago
- a lightweight LLM model inference framework☆739Updated last year
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆1,690Updated this week
- 标书大模型(Proposal-LLM Chinese version )☆271Updated 10 months ago
- ☆351Updated last year
- RAG SYSTEM FOR RWKV☆51Updated 9 months ago