vproxy-tools / ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆39Updated last week
Alternatives and similar repositories for ktransformers:
Users that are interested in ktransformers are comparing it to the libraries listed below
- Efficient inference of large language models.☆146Updated 4 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models forked from deepseek-ai/Janus☆17Updated 2 months ago
- run DeepSeek-R1 GGUFs on KTransformers☆224Updated last month
- ☆15Updated last month
- ☆85Updated last month
- ai法律团队☆41Updated 4 months ago
- 爬取知乎问题“什么时候你真的很心疼一只猫?”问题下的回答(截至2022.4.18)x-zse-96逆向分析☆12Updated 3 years ago
- 支持中文场景的的小语言模型 llama2.c-zh☆144Updated last year
- 哈基米 一个分布式蜜网系统 | hachimi A Distributed Honeypot System☆177Updated 3 months ago
- ☆12Updated 3 years ago
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9Updated 2 years ago
- support BM25+vecetor☆26Updated 5 months ago
- AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识☆122Updated this week
- 大模型中文测试题库-民间版本☆78Updated last year
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆243Updated last week
- CPU inference for the DeepSeek family of large language models in C++☆288Updated this week
- 电子鹦鹉 / Toy Language Model☆163Updated 2 weeks ago
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,097Updated this week
- run chatglm3-6b in BM1684X☆38Updated last year
- 百度QA100万数据集☆47Updated last year
- ktransformers v0.3 docker build and run☆12Updated 2 months ago
- LM inference server implementation based on *.cpp.☆173Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆11Updated 9 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆67Updated last week
- 一个手把手教你从零开始编写GPT并训练大语言模型的教程☆71Updated 3 months ago
- MoLing is a computer-use and browser-use based MCP server. It is a locally deployed, dependency-free office AI assistant.☆267Updated this week
- GLM Series Edge Models☆136Updated 2 months ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆52Updated last year
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆72Updated last month
- ☆129Updated 2 months ago