kvcache-ai / ktransformersLinks
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
☆16,344Updated this week
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- SGLang is a high-performance serving framework for large language models and multimodal models.☆22,343Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,920Updated 3 months ago
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆8,953Updated this week
- Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.☆4,380Updated this week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆37,249Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,129Updated last week
- A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval☆12,852Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,521Updated this week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆7,040Updated 6 months ago
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆11,979Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,324Updated 2 months ago
- Integrate the DeepSeek API into popular softwares☆35,086Updated 3 months ago
- ☆4,590Updated last month
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆50,714Updated this week
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆6,041Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (…☆12,112Updated last week
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,569Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,298Updated 6 months ago
- 分享一些好用的 Dify DSL 工作流程,自用、学习两相宜。 Sharing some Dify workflows.☆9,949Updated 7 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆4,565Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,207Updated 4 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆20,157Updated last month
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,719Updated 3 months ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆18,865Updated this week
- DeepEP: an efficient expert-parallel communication library☆8,898Updated 3 weeks ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,271Updated 8 months ago
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。☆19,839Updated this week
- 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)☆18,972Updated 6 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆65,942Updated this week
- No fortress, purely open ground. OpenManus is Coming.☆53,204Updated 2 weeks ago