vproxy-tools / ktransformersLinks
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆40Updated last month
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below
Sorting:
- run DeepSeek-R1 GGUFs on KTransformers☆236Updated 3 months ago
- ☆17Updated 3 months ago
- Efficient inference of large language models.☆148Updated last week
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 5 months ago
- ☆90Updated 3 months ago
- An AI agent to control drones☆112Updated this week
- KTransformers 一键部署脚本☆47Updated 2 months ago
- LM inference server implementation based on *.cpp.☆226Updated this week
- 支持中文场景的的小语言模型 llama2.c-zh☆147Updated last year
- 电子鹦鹉 / Toy Language Model☆171Updated 2 weeks ago
- Compile & run a single CUDA file on the cloud GPUs☆14Updated 9 months ago
- support BM25+vecetor☆29Updated last month
- run chatglm3-6b in BM1684X☆39Updated last year
- ☆96Updated last week
- Ollama 模型 Registry 镜像站 / 加速器,让 Ollama 从 ModelScope 魔搭 更快的 拉取 / 下载 模型。☆95Updated 2 months ago
- GLM Series Edge Models☆142Updated 2 weeks ago
- 360zhinao☆290Updated last month
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,147Updated last week
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆38Updated 6 months ago
- A pure Markdown documents showcase☆26Updated 10 years ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆15Updated 8 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- transformer安全相关☆31Updated 8 months ago
- 网络包测试工具☆45Updated last year
- 哈基米 一个分布式蜜网系统 | hachimi A Distributed Honeypot System☆180Updated 5 months ago
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆168Updated 7 months ago
- ☆133Updated 4 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆256Updated 3 weeks ago
- 百度QA100万数据集☆47Updated last year