inclusionAI / MoBELinks
Mixture-of-Basis-Experts for Compressing MoE-based LLMs
☆24Updated last week
Alternatives and similar repositories for MoBE
Users that are interested in MoBE are comparing it to the libraries listed below
Sorting:
- A comprehensive and efficient long-context model evaluation framework☆28Updated this week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆55Updated 3 weeks ago
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated last year
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆16Updated last month
- FuseAI Project☆88Updated 11 months ago
- ☆14Updated 11 months ago
- Official Implementation of APB (ACL 2025 main Oral)☆32Updated 10 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Updated 3 months ago
- ☆16Updated last year
- ☆21Updated 3 weeks ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆23Updated 2 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆18Updated 11 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 4 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Updated 6 months ago
- ☆19Updated 11 months ago
- ☆52Updated 5 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆26Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- ☆46Updated 3 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆33Updated 7 months ago
- ☆17Updated 5 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆21Updated last year
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆33Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Updated 10 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Updated last week
- ☆20Updated 9 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆46Updated 3 months ago
- ☆36Updated 5 months ago