A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆87Sep 5, 2025Updated 8 months ago
Alternatives and similar repositories for moe-pruner
Users that are interested in moe-pruner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple Fast API Backend for Ironclad/rivet☆26Jan 9, 2024Updated 2 years ago
- ☆12Dec 21, 2024Updated last year
- This is an Android App. Now with 100% less bugs.☆10Sep 26, 2019Updated 6 years ago
- ROSA-Tuning☆73Feb 4, 2026Updated 3 months ago
- ☆17Jan 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆81Mar 25, 2025Updated last year
- ☆28Aug 27, 2025Updated 8 months ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 11 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- RWKV centralised docs for the community☆33Jan 17, 2026Updated 4 months ago
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Telegram bot which can work with both openAI and LocalAI modes, it also uses UncensoredGPT models like Wizard-Uncensored. It can be launc…☆23Mar 14, 2025Updated last year
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Compare openresty vs nginx + PUC_lua☆18Nov 3, 2023Updated 2 years ago
- Language modeling with linear-cost context☆118Sep 25, 2025Updated 8 months ago
- Demonstration of a factory pattern where the types automatically register themselves☆13Mar 13, 2019Updated 7 years ago
- ☆19Sep 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models