vproxy-tools / ktransformersLinks
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆40Updated 2 months ago
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆242Updated 4 months ago
- Efficient inference of large language models.☆149Updated last month
- ☆17Updated 4 months ago
- 电子鹦鹉 / Toy Language Model☆189Updated last week
- CPU inference for the DeepSeek family of large language models in C++☆308Updated last month
- KTransformers 一键部署脚本☆48Updated 2 months ago
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 5 months ago
- An AI agent to control drones☆115Updated this week
- ☆149Updated last year
- LM inference server implementation based on *.cpp.☆236Updated this week
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,159Updated this week
- CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…☆160Updated last week
- 大模型中文测试题库-民间版本☆84Updated 2 years ago
- run chatglm3-6b in BM1684X☆39Updated last year
- Phi3 中文后训练模型仓库☆321Updated 7 months ago
- A pure Markdown documents showcase☆29Updated 10 years ago
- Ollama 模型 Registry 镜像站 / 加速器,让 Ollama 从 ModelScope 魔搭 更快的 拉取 / 下载 模型。☆96Updated 3 months ago
- a huggingface mirror site.☆289Updated last year
- C++ implementation of Qwen-LM☆596Updated 7 months ago
- 360zhinao☆290Updated 2 months ago
- MoLing is a computer-use and browser-use based MCP server. It is a locally deployed, dependency-free office AI assistant.☆305Updated last month
- 百度QA100万数据集☆47Updated last year
- ☆110Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆45Updated last week
- ☆101Updated this week
- ☆133Updated 5 months ago
- ☆90Updated last week
- OrionStar-Yi-34B-Chat 是 一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆259Updated last year
- ai法律团队☆42Updated 6 months ago