JieShibo/MoLE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JieShibo/MoLE)

JieShibo / MoLE

[ICML 2025 Oral] Mixture of Lookup Experts

☆78

Alternatives and similar repositories for MoLE

Users that are interested in MoLE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
Bigyehahaha / M4
View on GitHub
The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》
☆14Mar 31, 2025Updated last year
Taishi-N324 / Drop-Upcycling
View on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
☆25Oct 5, 2025Updated 9 months ago
ysngki / UMoE
View on GitHub
☆23Oct 22, 2025Updated 9 months ago
UCI-CORSA / TeLLMe_FPGA_2026
View on GitHub
TeLLMe: An Efficient End-to-End Ternary LLM Prefill and Decode Accelerator with Table-Lookup Matmul on Edge FPGAs [FPGA2026]
☆32Mar 11, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
JarvisPei / CMoE
View on GitHub
[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
☆46Jun 30, 2026Updated 3 weeks ago
ChenZiHong-Gavin / MoE-Visualizer
View on GitHub
MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.
☆16Apr 8, 2025Updated last year
cmu-flame / FLAME-MoE
View on GitHub
Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models
☆42Sep 19, 2025Updated 10 months ago
JieShibo / MemVP
View on GitHub
[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
☆49May 12, 2024Updated 2 years ago
yynil / RWKVInside
View on GitHub
☆41Apr 30, 2025Updated last year
nanowell / Q-Sparse-LLM
View on GitHub
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆37Aug 14, 2024Updated last year
wangyccn / CR-AI-V1.5
View on GitHub
CRAI is a multimodal large language model based on the Mixture of Experts (MoE) architecture, supporting text and image cross-modal tasks…
☆16Apr 29, 2025Updated last year
VITA-Group / Linearity-Grafting
View on GitHub
[ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…
☆16Jun 22, 2022Updated 4 years ago
iankur / vqllm
View on GitHub
Residual vector quantization for KV cache compression in large language model
☆12Oct 22, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Kaffaljidhmah2 / SpecDec_pp
View on GitHub
Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
☆19Jul 10, 2025Updated last year
CZhongnan / Pan-LUT
View on GitHub
☆15Apr 13, 2026Updated 3 months ago
NJUNLP / MoE-LPR
View on GitHub
☆22Dec 11, 2024Updated last year
cmzsbql / SDformer
View on GitHub
[NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"
☆17May 23, 2025Updated last year
Alex-HaochenLi / Soft-InfoNCE
View on GitHub
[EMNLP'23] Code for 'Rethinking Negative Pairs in Code Search'
☆16Oct 17, 2023Updated 2 years ago
littleSunlxy / FedPU-torch
View on GitHub
Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data
☆10Aug 12, 2022Updated 3 years ago
pablodawson / ldm3d-inpainting
View on GitHub
Joint image and Depth inpainting, ldm3d
☆16Apr 28, 2024Updated 2 years ago
Aaronhuang-778 / Mixture-Compressor-MoE
View on GitHub
[ICLR 2025, IEEE TPAMI 2026] Mixture Compressor & MC#
☆75Feb 12, 2025Updated last year
wrmedford / moe-scaling
View on GitHub
Scaling Laws for Mixture of Experts Models
☆15Feb 25, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kyegomez / MHMoE
View on GitHub
Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆30Updated this week
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
zdebruine / MMVAE
View on GitHub
Mixture-of-Experts Multimodal Variational Autoencoder
☆15Jul 3, 2025Updated last year
ZhouYuxuanYX / Maximum-Suppression-Regularization
View on GitHub
This is the official repository of our NeurIPS 2025 paper "MaxSup: Overcoming Representation Collapse in Label Smoothing"
☆22Nov 6, 2025Updated 8 months ago
caxenie / fuzzy-ctrl-demo
View on GitHub
Trajectory tracking control for wheeled mobile robots in a robot soccer field using Fuzzy Logic.
☆12Jul 18, 2021Updated 5 years ago
core-mm / core-mm
View on GitHub
☆17Feb 22, 2024Updated 2 years ago
lutnn / blink-mm
View on GitHub
☆16Jul 24, 2023Updated 3 years ago
Haoqing-Wang / CPNWCP
View on GitHub
[ECCV 2022] Contrastive Prototypical Network with Wasserstein Confidence Penalty
☆11Oct 20, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
thu-ml / ReMoE
View on GitHub
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆118Dec 20, 2024Updated last year
zzzx1224 / EBTSA-ICLR2023
View on GitHub
☆12Feb 17, 2025Updated last year
Alex-HaochenLi / RACS
View on GitHub
[EMNLP'22] Code for 'Exploring Representation-level Augmentation for Code Search'
☆28Oct 9, 2023Updated 2 years ago
zjnyly / TeraFly
View on GitHub
[DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs
☆38Nov 13, 2025Updated 8 months ago
zichongli5 / NorMuon
View on GitHub
Official Implementation for NorMuon paper
☆84Apr 30, 2026Updated 2 months ago
IntelliSys-Lab / FineMoE-EuroSys26
View on GitHub
☆15Sep 25, 2025Updated 10 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year