REAP: Router-weighted Expert Activation Pruning for SMoE compression
☆358Apr 17, 2026Updated last month
Alternatives and similar repositories for reap
Users that are interested in reap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆34May 2, 2026Updated 2 weeks ago
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆108Apr 18, 2026Updated last month
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆878Updated this week
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆56Mar 16, 2026Updated 2 months ago
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆93Mar 6, 2026Updated 2 months ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 10 months ago
- 电报注册教程:2025年国内手机号注册Telegram收不到验证码怎么办?【已解决】本文还将会详细介绍如何下载、安装、注册和使用Telegram电报,并会为大家提供在中国使用Telegram电报的一些实用建议☆19Apr 9, 2025Updated last year
- This is a custom node for n8n. It allows you to execute PowerShell commands within an n8n workflow.☆14May 16, 2023Updated 3 years ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆23Nov 11, 2025Updated 6 months ago
- Model souping for LLMs☆73Nov 18, 2025Updated 6 months ago
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated 4 months ago
- A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support…☆1,394May 14, 2026Updated last week
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆82Mar 22, 2026Updated last month
- A minimal CLI tool for piping anything into an LLM.☆21Jan 1, 2026Updated 4 months ago
- ☆21Dec 9, 2025Updated 5 months ago
- Use winsqlite3.dll (the SQLite DLL that ships with Windows 10) in PowerShell☆13Jan 12, 2025Updated last year
- RADLADS training code☆42May 7, 2025Updated last year
- OCTAVE protocol - structured AI communication with 3-20x token reduction. MCP server with lenient-to-canonical pipeline and schema valida…☆50May 13, 2026Updated last week
- ☆19Jul 4, 2025Updated 10 months ago
- an autonomous independent digital companion☆14Feb 12, 2026Updated 3 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Qwen3-Coder-Next optimization to run on 8+ Gb VRAM!!!☆38Mar 5, 2026Updated 2 months ago
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆44Jul 9, 2025Updated 10 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 4 months ago
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆48Apr 24, 2026Updated 3 weeks ago
- The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"☆108Updated this week
- ☆114Jun 19, 2025Updated 11 months ago
- The Active Reliability Layer for AI Agents. Catch failures, teach fixes, and automate reliability☆132Jan 19, 2026Updated 4 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 2 months ago
- Official Implementation for NorMuon paper☆70Apr 30, 2026Updated 2 weeks ago
- Your AI Soul Companion. Self-hosted AI agent across 30+ messaging channels It can not only serve as an emotional companion in daily life …☆45Updated this week
- Memory Agent monorepo☆87Oct 9, 2025Updated 7 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Apr 16, 2026Updated last month