inclusionAI/MoBE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/inclusionAI/MoBE)

inclusionAI / MoBE

Mixture-of-Basis-Experts for Compressing MoE-based LLMs

☆34

Alternatives and similar repositories for MoBE

Users that are interested in MoBE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCKBReasoning / LLM-Streamline
View on GitHub
Official implementation of the ICLR paper "Streamlining Redundant Layers to Compress Large Language Models"
☆43May 1, 2025Updated last year
lyj20071013 / Triton-FlashAttention
View on GitHub
This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance im…
☆11Mar 26, 2026Updated last month
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 7 months ago
BaiTheBest / SRDML
View on GitHub
GitHub Repository for KDD 2022 paper "Saliency-Regularized Deep Multi-Task Learning"
☆12Sep 26, 2023Updated 2 years ago
colehawkins / bayesian-tensor-rank-determination
View on GitHub
☆13Dec 17, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NJUNLP / Hallu-PI
View on GitHub
The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed …
☆11Sep 27, 2024Updated last year
toshas / sttp
View on GitHub
Spectral Tensor Train Parameterization of Deep Learning Layers
☆17Jul 1, 2021Updated 4 years ago
WenyiWU0111 / CoMEM
View on GitHub
This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.
☆26Jul 3, 2025Updated 10 months ago
eth-lre / PedagogicalRL
View on GitHub
Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral
☆36Dec 11, 2025Updated 5 months ago
kaiqi123 / SQAKD
View on GitHub
☆16May 3, 2024Updated 2 years ago
aster2024 / SWIFT
View on GitHub
Source code for SWIFT, an efficient reward model.
☆21Jan 13, 2026Updated 4 months ago
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆18Nov 4, 2025Updated 6 months ago
priorelli / dynamic-planning
View on GitHub
dynamic planning, hybrid models, hierarchical active inference, tool use
☆15Jun 13, 2025Updated 11 months ago
mbedross / MachineLearningObjectTracking
View on GitHub
MLOT - A machine learning algorithm, written for use with MATLAB, in order to track in 3D moving particles based on a training data set. …
☆12Dec 24, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lr8soft / PythonSTG
View on GitHub
☆16Apr 6, 2023Updated 3 years ago
devansh20la / LPF-SGD
View on GitHub
☆17Dec 11, 2022Updated 3 years ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
iCAS-SJTU / Shift-Left-EDA-Papers
View on GitHub
This github repository summarizes relevant papers for shift left techniques in electronic design automation (EDA).
☆32Sep 19, 2025Updated 8 months ago
memirror / magicMirror
View on GitHub
魔镜魔镜，无所不知的魔镜[-_-](并不是)
☆13Jun 10, 2021Updated 4 years ago
Wei-Nijuan / DecisionSpikeFormer
View on GitHub
[CVPR 2025] Decision SpikeFormer: Spike-Driven Transformer for Decision Making
☆19Aug 8, 2025Updated 9 months ago
corl-team / lime
View on GitHub
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated 11 months ago
NeuroAIHub / NetFormer
View on GitHub
☆13Nov 18, 2025Updated 6 months ago
ziplab / CoV
View on GitHub
[ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning
☆60Apr 7, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 6, 2024Updated last year
Infini-AI-Lab / STEM
View on GitHub
☆66May 7, 2026Updated 2 weeks ago
ocademy-ai / open-learning-resources
View on GitHub
A curated list of free/open source resources for you to learn Computer Science.
☆19Jul 4, 2023Updated 2 years ago
LCM-Lab / LOOM-Eval
View on GitHub
A comprehensive and efficient long-context model evaluation framework
☆31Feb 25, 2026Updated 2 months ago
inclusionAI / Ring
View on GitHub
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
☆109Aug 5, 2025Updated 9 months ago
vpuhoff / noprop-dt-mnist-pytorch
View on GitHub
This repository contains an experimental PyTorch implementation exploring the NoProp algorithm, presented in the paper "NOPROP: TRAINING …
☆16May 14, 2026Updated last week
JerryYin777 / Cross-Layer-Attention
View on GitHub
Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)
☆17May 24, 2024Updated last year
eth-lre / mathtutorbench
View on GitHub
Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
☆35Nov 18, 2025Updated 6 months ago
NVIDIA-NeMo / ProRL-Agent-Server
View on GitHub
Agentic RL on Any Harness at Scale
☆136May 15, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
stockeh / mlx-drifting-model
View on GitHub
Generative Modeling via Drifting in MLX
☆43Feb 6, 2026Updated 3 months ago
GuanZhengChen / GGPN
View on GitHub
☆10Dec 11, 2021Updated 4 years ago
SeanPesce / Spade-Web-Viewer
View on GitHub
Utility to convert Spade device video streams to MJPEG for live viewing in web browsers, VLC, etc.
☆11Nov 20, 2023Updated 2 years ago
visjs / vis-util
View on GitHub
Helper functions for the visjs family
☆15May 15, 2026Updated last week
tml-epfl / sam-low-rank-features
View on GitHub
Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]
☆29Sep 22, 2023Updated 2 years ago
Szzx123 / PlatformIoT-Environment-RaspberryPi-Django-AWS-Web
View on GitHub
☆20Mar 22, 2023Updated 3 years ago
VimsLab / MeshNet2
View on GitHub
☆23Aug 1, 2022Updated 3 years ago