ubergarm / r1-ktransformers-guide
run DeepSeek-R1 GGUFs on KTransformers
☆224Updated last month
Alternatives and similar repositories for r1-ktransformers-guide:
Users that are interested in r1-ktransformers-guide are comparing it to the libraries listed below
- LM inference server implementation based on *.cpp.☆173Updated this week
- High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.☆1,097Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆243Updated last week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆39Updated last week
- ☆225Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆49Updated 5 months ago
- ROGRAG: A Robustly Optimized GraphRAG Framework☆111Updated this week
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。☆170Updated this week
- Community maintained hardware plugin for vLLM on Ascend☆515Updated this week
- ☆311Updated 4 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆152Updated this week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆834Updated this week
- This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.☆85Updated 3 weeks ago
- ☆243Updated 3 months ago
- ☆269Updated this week
- Mixture-of-Experts (MoE) Language Model☆186Updated 7 months ago
- Build & Optimize your RAG.☆624Updated this week
- Alpaca Chinese Dataset -- 中文指令微调数据集☆199Updated 6 months ago
- ☆140Updated 11 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆230Updated 5 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆61Updated this week
- ☆119Updated last week
- Phi3 中文后训练模型仓库☆321Updated 4 months ago
- GOT-OCR的GUI版本,提供OCR、导出PDF、批处理等功能,但不提供训练功能☆168Updated last month
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆250Updated last month
- Train a 1B LLM with 1T tokens from scratch by personal☆613Updated last month
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化 ,模型拥有1B参数,支持中英文。☆369Updated 2 months ago
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆452Updated last month
- Port of Facebook's LLaMA model in C/C++☆92Updated this week