txg1550759 / ktransformers-v0.3-dockerLinks
ktransformers v0.3 docker build and run
☆13Updated 8 months ago
Alternatives and similar repositories for ktransformers-v0.3-docker
Users that are interested in ktransformers-v0.3-docker are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆255Updated 8 months ago
- LM inference server implementation based on *.cpp.☆290Updated 3 months ago
- KTransformers 一键部署脚本☆54Updated 7 months ago
- ☆347Updated last year
- AI Proxy is a high-performance AI gateway using OpenAI's and Claude protocol as the entry point. It features intelligent error handling, …☆238Updated this week
- ☆42Updated 6 months ago
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆234Updated this week
- ☆181Updated this week
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆327Updated last month
- Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.☆24Updated 5 months ago
- Unsloth框架在Windows平台微调训练Qwen2大模型,非WSL☆61Updated last year
- Phi3 中文后训练模型仓库☆324Updated 11 months ago
- LLM 并发性能测试工具,支持自动化压力测试和性能报告生成。☆184Updated 7 months ago
- 大模型中文测试题库-民间版本☆89Updated 2 years ago
- ☆149Updated this week
- ☆273Updated 10 months ago
- ☆171Updated 7 months ago
- One command to run ChatTTS☆61Updated last year
- dify's rag patch module☆277Updated 2 months ago
- A code executor for Dify that is compatible with the official sandbox API calls and dependency installation.☆357Updated 6 months ago
- 🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆548Updated 3 weeks ago
- Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。☆295Updated last year
- 支持OpenAI标准响应格式,可部署为服务并连接任意 支持该格式的前端服务☆38Updated 10 months ago
- DIFY PULGIN 插件源码集合☆315Updated 5 months ago
- MinerU API server☆81Updated 11 months ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆65Updated 6 months ago
- 中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)☆610Updated last year
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆260Updated 7 months ago
- 一套基于Vllm的显存内存混合模式大模型部署工具(图形界面),VRAMandDRAM模式虽然慢一点,但是解决了超大模型在普通家用计算机上的部署问题。☆88Updated 6 months ago
- GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)☆104Updated last year