REAP: Router-weighted Expert Activation Pruning for SMoE compression
☆292Mar 18, 2026Updated this week
Alternatives and similar repositories for reap
Users that are interested in reap are comparing it to the libraries listed below
Sorting:
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 4 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆686Updated this week
- ☆21Apr 2, 2025Updated 11 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆18Feb 10, 2026Updated last month
- Fused Qwen3 MoE layer for faster training, compatible with Transformers, LoRA, bnb 4-bit quant, Unsloth. Also possible to train LoRA over…☆241Feb 19, 2026Updated last month
- Voice Cloning, Now Inside Kokoro. Generate natural multilingual speech and clone any target voice with ease.☆60Mar 11, 2026Updated last week
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 3 weeks ago
- OCTAVE protocol - structured AI communication with 3-20x token reduction. MCP server with lenient-to-canonical pipeline and schema valida…☆41Updated this week
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated 2 months ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- A minimal CLI tool for piping anything into an LLM.☆19Jan 1, 2026Updated 2 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- ☆19Dec 9, 2025Updated 3 months ago
- Long-term Research Assistants with Self-Scheduling☆53Mar 10, 2026Updated last week
- Kernel Library Wheel for SGLang☆16Updated this week
- an autonomous independent digital companion☆14Feb 12, 2026Updated last month
- A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.☆104Jul 9, 2025Updated 8 months ago
- ☆10Mar 8, 2025Updated last year
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆43Jul 9, 2025Updated 8 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 2 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆82Feb 20, 2026Updated last month
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated 10 months ago
- ☆13Dec 21, 2024Updated last year
- Official Implementation for NorMuon paper☆61Mar 11, 2026Updated last week
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- A backup of SmokelessRuntimeEFIPatcher☆28Jun 19, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated last year
- [Mirrored from UPM, not affiliated with Unity Technologies.] 📦 The Terrain Tools package adds additional Terrain sculpting brushes and t…☆16Feb 25, 2026Updated 3 weeks ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 8 months ago
- 🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantiza…☆883Mar 13, 2026Updated last week
- JotItNow is a AI Voice Notes App☆25Mar 6, 2025Updated last year
- 这是仿炉石传说的一个卡牌游戏,客户端是unity3D,服务器是KBEngine☆12Jul 3, 2018Updated 7 years ago
- Training tiny models to prove hard theorems☆59Mar 5, 2026Updated 2 weeks ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆66Aug 3, 2025Updated 7 months ago