vproxy-tools / ktransformersLinks
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆44Updated 9 months ago
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- Efficient inference of large language models.☆149Updated 4 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆260Updated 11 months ago
- 电子鹦鹉 / Toy Language Model☆258Updated last week
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated last year
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆543Updated 4 months ago
- CPU inference for the DeepSeek family of large language models in C++☆317Updated 4 months ago
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆225Updated 3 weeks ago
- MoonPalace(月宫)是由 Moonshot AI 月之暗面提供的 API 调试工具。☆221Updated last year
- run chatglm3-6b in BM1684X☆39Updated last year
- Compile & run a single CUDA file on the cloud GPUs☆14Updated last year
- ☆94Updated 6 months ago
- ☆135Updated 11 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- C++ implementation of Qwen-LM☆616Updated last year
- a huggingface mirror site.☆326Updated last year
- ncnn android robust video matting☆20Updated 3 weeks ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,413Updated last week
- Wanna breeze through some papers?☆82Updated last week
- 轻量级高性能中文分词项目☆199Updated 2 years ago
- Detect CPU features with single-file☆442Updated last month
- 大模型中文测试题库-民间版本☆95Updated 2 years ago
- 实现国产算力大模型零门槛部署,一键跑通 Qwen、GLM-4.7、Minimax-2.1、DeepSeek-OCR 等模型☆236Updated this week
- KTransformers 一键部署脚本☆57Updated 9 months ago
- ☆114Updated last year
- Transformer framework for edge computing based on C++.☆130Updated last year
- Tiny C++ LLM inference implementation from scratch☆102Updated last week
- xllamacpp - a Python wrapper of llama.cpp☆72Updated last week
- a lightweight LLM model inference framework☆749Updated last year
- Control drones with natural language☆167Updated 2 weeks ago
- Agents of C.L.I.☆142Updated 4 months ago