vproxy-tools / ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆40Updated 2 weeks ago
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆227Updated 2 months ago
- Efficient inference of large language models.☆146Updated 5 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 3 months ago
- ☆16Updated 2 months ago
- ☆88Updated 2 months ago
- ai法律团队☆41Updated 4 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 7 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,113Updated this week
- 一个中文语音转文字项目,封装自FireRedASR☆47Updated 2 months ago
- ☆13Updated 3 years ago
- 电子鹦鹉 / Toy Language Model☆169Updated this week
- ☆131Updated 3 months ago
- 网络包测试工具☆45Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆12Updated 10 months ago
- 用于kokoro TTS的webui界面和兼容openai api☆32Updated 3 months ago
- LM inference server implementation based on *.cpp.☆191Updated this week
- Compile & run a single CUDA file on the cloud GPUs☆14Updated 8 months ago
- 大模型中文测试题库-民间版本☆82Updated last year
- golang生成支持MSVC调用的dll☆13Updated 2 years ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆60Updated last week
- ☆16Updated 9 months ago
- 360zhinao☆289Updated this week
- A pure Markdown documents showcase☆23Updated 10 years ago
- 自动将大模型部署成openai,并且自动切换模型,自动伸缩扩容☆29Updated 2 weeks ago
- 我从动漫中学习到的知识和人生感悟☆17Updated 2 months ago
- run chatglm3-6b in BM1684X☆38Updated last year
- transformer安全相关☆30Updated 6 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆64Updated last week
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆126Updated 2 years ago