maaaxinfinity / ktrunLinks
KTransformers 一键部署脚本
☆55Updated 8 months ago
Alternatives and similar repositories for ktrun
Users that are interested in ktrun are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆259Updated 10 months ago
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问题。☆91Updated 8 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,374Updated this week
- ☆44Updated 8 months ago
- ☆172Updated 9 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Updated 3 months ago
- LM inference server implementation based on *.cpp.☆295Updated last month
- LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features …☆98Updated this week
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆202Updated 3 weeks ago
- 使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话,并拥有美观的界面。☆153Updated 2 months ago
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆262Updated 9 months ago
- Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.☆4,356Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆44Updated 8 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆243Updated 2 weeks ago
- ☆274Updated last year
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆89Updated last year
- A tool for creating pre-training datasets for language models, supporting one-click batch processing for both text and image datasets. 一个…☆43Updated last year
- Community maintained hardware plugin for vLLM on Ascend☆1,532Updated this week
- ☆349Updated last year
- Phi3 中文后训练模型仓库☆324Updated last year
- Ollama Desktop是基于Ollama引擎的一个桌面应用解决方案,用于在macOS、Windows和Linux操作系统上运行和管理Ollama模型的GUI工具。☆177Updated 5 months ago
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,215Updated this week
- 大模型中文测试题库-民间版本☆92Updated 2 years ago
- OpenKAG (Open Knowledge Augmented Generation), is an enterprise intelligent knowledge platform based on large model technology.☆57Updated 9 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆29Updated 2 months ago
- The AI-powered PPT generation service based on ChatPPT can create presentations based on themes, requirements, or uploaded documents, sup…☆120Updated 3 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆223Updated this week
- KnowFlowRAG☆406Updated last week
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆169Updated last year