maaaxinfinity / ktrunLinks
KTransformers 一键部署脚本
☆54Updated 7 months ago
Alternatives and similar repositories for ktrun
Users that are interested in ktrun are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆255Updated 8 months ago
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆327Updated last month
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问 题。☆88Updated 6 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,331Updated this week
- ☆42Updated 6 months ago
- triton3.2.0添加mi25/mi50/mi60支持☆14Updated 6 months ago
- LM inference server implementation based on *.cpp.☆290Updated 3 months ago
- ☆171Updated 7 months ago
- AI虚拟伙伴Linux版☆118Updated 3 months ago
- LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features …☆76Updated 2 weeks ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆216Updated 3 months ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆184Updated 7 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆260Updated 7 months ago
- OpenKAG (Open Knowledge Augmented Generation), is an enterprise intelligent knowledge platform based on large model technology.☆55Updated 7 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Updated 2 months ago
- 使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话,并拥有美观的界面。☆139Updated 3 weeks ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year
- Community maintained hardware plugin for vLLM on Ascend☆1,357Updated this week
- The AI-powered PPT generation service based on ChatPPT can create presentations based on themes, requirements, or uploaded documents, sup…☆115Updated 2 months ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆1,275Updated last year
- ☆347Updated last year
- Ollama Desktop是基于Ollama引擎的一个桌面应用解决方案,用于在macOS、Windows和Linux操作系统上运行和管理Ollama模型的GUI工具。☆173Updated 4 months ago
- KnowFlowRAG☆365Updated this week
- Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用☆1,108Updated 2 months ago
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Updated 3 weeks ago
- DIFY PULGIN 插件源码集合☆315Updated 5 months ago
- 一个超低延迟的基于GPT-SoVITS语音合成的语音交互系统☆164Updated last week
- MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆282Updated 11 months ago
- ktransformers v0.3 docker build and run☆13Updated 8 months ago
- ☆273Updated 10 months ago