maaaxinfinity / ktrunLinks
KTransformers 一键部署脚本
☆45Updated last month
Alternatives and similar repositories for ktrun
Users that are interested in ktrun are comparing it to the libraries listed below
Sorting:
- triton3.2.0添加mi25/mi50/mi60支持☆11Updated last month
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆63Updated last week
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问题。☆64Updated last month
- run DeepSeek-R1 GGUFs on KTransformers☆234Updated 3 months ago
- ☆146Updated 2 months ago
- ☆23Updated last month
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆110Updated 9 months ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,134Updated this week
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆251Updated 2 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆37Updated 9 months ago
- ☆248Updated 5 months ago
- ☆349Updated 10 months ago
- Build & Optimize your RAG.☆685Updated 3 weeks ago
- Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用☆530Updated this week
- Phi3 中文后训练模型仓库☆321Updated 6 months ago
- VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objec…☆149Updated 7 months ago
- ☆232Updated 3 months ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆64Updated 9 months ago
- LM inference server implementation based on *.cpp.☆206Updated this week
- 大模型中文测试题库-民间版本☆84Updated 2 years ago
- MinerU是一款开源的高质量PDF解析工具,基于深度学习技术,可自动提取PDF文档中的文字、表格、图片、公式等内容,并提供丰富的分析、统计、搜索等功能。 本项目为其提供一个简化版本的WebUI,方便用户上传PDF文件,并实时展示提取结果。☆201Updated 6 months ago
- The AI-powered PPT generation service based on ChatPPT can create presentations based on themes, requirements, or uploaded documents, sup…☆52Updated last week
- Chat2Graph: Graph Native Agentic System.☆216Updated this week
- GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combin…☆514Updated 4 months ago
- gpt_server 是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆184Updated last week
- Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL☆60Updated 11 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆191Updated this week
- 为AI带路党Pro视频准备☆247Updated 3 months ago
- 文本语料转训练集工具,txt转dataset☆92Updated last year
- 为酒馆用户搭建LightRAG并连接酒馆,以实现更先进更好用的RAG+酒馆(目前与另一个仓库同步开发中)☆37Updated 5 months ago