kvcache-ai / ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆13,971Updated this week
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆20,762Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆14,188Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,802Updated this week
- Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥☆38,553Updated this week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆20,678Updated 2 weeks ago
- 🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation☆16,265Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆10,262Updated last week
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,542Updated 3 weeks ago
- No fortress, purely open ground. OpenManus is Coming.☆45,370Updated last week
- FlashMLA: Efficient MLA decoding kernels☆11,527Updated 2 weeks ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆48,565Updated this week
- Integrate the DeepSeek API into popular softwares☆32,269Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆8,313Updated this week
- A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.☆13,298Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆33,273Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) an…☆7,450Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆5,894Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,848Updated this week
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,755Updated last month
- Fully open reproduction of DeepSeek-R1☆24,340Updated this week
- 🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.☆26,122Updated this week
- A simple screen parsing tool towards pure vision based GUI agent