ubergarm / r1-ktransformers-guideView external linksLinks
run DeepSeek-R1 GGUFs on KTransformers
☆261Mar 3, 2025Updated 11 months ago
Alternatives and similar repositories for r1-ktransformers-guide
Users that are interested in r1-ktransformers-guide are comparing it to the libraries listed below
Sorting:
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆16,501Feb 7, 2026Updated last week
- ☆57Feb 10, 2025Updated last year
- llama.cpp fork with additional SOTA quants and improved performance☆1,605Updated this week
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- CPU inference for the DeepSeek family of large language models in C++☆315Oct 2, 2025Updated 4 months ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 weeks ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Updated this week
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- d.run website☆15Updated this week
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- AI Demo 项目,一个专门为希望 学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated last month
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- A chess arena for large language models☆38May 22, 2025Updated 8 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- flux1非官方的量化模型(flux1 unofficial quantize model)☆12Aug 14, 2024Updated last year
- llms related stuff , including code, docs☆13Feb 25, 2025Updated 11 months ago
- RUN LLAMA-3 70B llm with NVIDIA endpoints☆14Apr 20, 2024Updated last year
- pre-training llama3 using chinese☆13May 1, 2024Updated last year
- Rancher Prime GC Catalog☆13Jan 22, 2026Updated 3 weeks ago
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- ☆14Dec 6, 2023Updated 2 years ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Awesome LLM speech-to-speech models and frameworks☆39Nov 17, 2025Updated 2 months ago
- An implementation of the Tsetlin Machine in Rust☆16Apr 15, 2018Updated 7 years ago
- "Make-A-Video", new SOTA text to video by Meta-FAIR - Tensorflow☆14Oct 22, 2022Updated 3 years ago
- ☆20Jun 28, 2025Updated 7 months ago
- JotItNow is a AI Voice Notes App☆24Mar 6, 2025Updated 11 months ago
- Zero-Shot Summarization with GPT-3☆16Sep 11, 2023Updated 2 years ago
- ☆18Apr 18, 2025Updated 9 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆21Feb 7, 2026Updated last week
- Context-aware LLM Translator (CALT)☆48Jan 8, 2025Updated last year
- ☆11Feb 6, 2026Updated last week
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 3 months ago
- ☆17Dec 16, 2024Updated last year
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- DPDK-based packet capture tool☆17Mar 2, 2017Updated 8 years ago
- ☆82Nov 11, 2024Updated last year