gabrielolympie / moe-prunerView external linksLinks
A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆83Sep 5, 2025Updated 5 months ago
Alternatives and similar repositories for moe-pruner
Users that are interested in moe-pruner are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆34Mar 6, 2025Updated 11 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated last year
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated 10 months ago
- ☆41Apr 30, 2025Updated 9 months ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- Telegram bot which can work with both openAI and LocalAI modes, it also uses UncensoredGPT models like Wizard-Uncensored. It can be launc…☆18Mar 14, 2025Updated 11 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- ☆27Aug 27, 2025Updated 5 months ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆24Jun 26, 2024Updated last year
- D^2-MoE: Delta Decompression for MoE-based LLMs Compression☆72Mar 25, 2025Updated 10 months ago
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆31Jan 28, 2026Updated 2 weeks ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆36Aug 27, 2025Updated 5 months ago
- ☆53Feb 11, 2025Updated last year
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆235Jun 15, 2025Updated 8 months ago
- ☆63Oct 17, 2023Updated 2 years ago
- Large language models for document ranking.☆71Jan 13, 2026Updated last month
- ☆119Jan 8, 2026Updated last month
- Synthetic Text Dataset Generation for LLM projects☆55Nov 28, 2025Updated 2 months ago
- RWKV centralised docs for the community☆31Jan 17, 2026Updated 3 weeks ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆18Nov 18, 2024Updated last year
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆37Dec 15, 2022Updated 3 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 9 months ago
- 추천시스템 주요 논문 리뷰 및 구현☆28Jan 6, 2024Updated 2 years ago
- This is the official repo for the paper "LLM-FE"☆55Feb 3, 2026Updated last week
- Lottery Ticket Adaptation☆40Nov 20, 2024Updated last year
- fine-tuning tutorial☆17Dec 13, 2025Updated 2 months ago
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- DOS Program Development☆12Nov 9, 2022Updated 3 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆91Feb 14, 2025Updated last year
- ☆41Sep 9, 2025Updated 5 months ago
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year
- ☆31Mar 13, 2024Updated last year
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆218Nov 27, 2025Updated 2 months ago