UNITES-Lab/MoE-RBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UNITES-Lab/MoE-RBench)

UNITES-Lab / MoE-RBench

[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"

☆11

Alternatives and similar repositories for MoE-RBench

Users that are interested in MoE-RBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Linzwcs / AutoMusicTheoryQA
View on GitHub
☆22Nov 21, 2025Updated 8 months ago
UNITES-Lab / Mew
View on GitHub
[ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"
☆17Jul 27, 2024Updated last year
UNITES-Lab / flash-molecular-dynamics
View on GitHub
Fast and accurate coarse-grained neural network molecular dynamics
☆15Jul 13, 2026Updated last week
Spico197 / MoE-SFT
View on GitHub
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
☆41Sep 29, 2024Updated last year
ysngki / XMoE
View on GitHub
☆15Oct 19, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
UNITES-Lab / MoE-Quantization
View on GitHub
Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"
☆31Jun 30, 2025Updated last year
OpenSparseLLMs / Open-Pandora
View on GitHub
Open-Pandora: On-the-fly Control Video Generation
☆35Nov 28, 2024Updated last year
UNITES-Lab / CryoNeRF
View on GitHub
☆23May 2, 2025Updated last year
MajorDavidZhang / MCL
View on GitHub
code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
☆20Jul 16, 2024Updated 2 years ago
UNITES-Lab / C2R-MoE
View on GitHub
[NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…
☆16Feb 4, 2025Updated last year
zhaochen0110 / Timo
View on GitHub
Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)
☆26Oct 23, 2024Updated last year
brightjade / SimCKP
View on GitHub
Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023
☆12Jun 20, 2025Updated last year
kaist-silab / meta-sage
View on GitHub
[ICML 2023] Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optim…
☆10Dec 19, 2023Updated 2 years ago
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
camenduru / FoleyCrafter-jupyter
View on GitHub
☆10Jun 28, 2024Updated 2 years ago
sunnweiwei / AmbigPrompt
View on GitHub
Answering Ambiguous Questions via Iterative Prompting
☆14May 25, 2024Updated 2 years ago
OpenSparseLLMs / Skip-DiT
View on GitHub
✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆80Jul 10, 2025Updated last year
wmt-conference / wmt23-news-systems
View on GitHub
☆14Oct 6, 2025Updated 9 months ago
r-three / smear
View on GitHub
☆30Sep 28, 2023Updated 2 years ago
peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year
ibraheem-moosa / mt-ranker
View on GitHub
Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking
☆10Feb 29, 2024Updated 2 years ago
OSU-BMBL / scGNN2.0
View on GitHub
☆12Dec 4, 2023Updated 2 years ago
wenzhe-li / Self-MoA
View on GitHub
☆17Feb 4, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
zhaochen0110 / Cotempqa
View on GitHub
Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)
☆31Jul 3, 2024Updated 2 years ago
LLM360 / k2-data-prep
View on GitHub
☆21Jun 4, 2024Updated 2 years ago
ictnlp / NMLA-NAT
View on GitHub
Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"
☆20Nov 16, 2022Updated 3 years ago
Huntersxsx / SJTU_2020_Spring
View on GitHub
上海交通大学2020春研究生的部分课程作业整理
☆16Jun 14, 2020Updated 6 years ago
Junseok0207 / scFP
View on GitHub
The official source code for "Single-cell RNA-seq data imputation using Feature Propagation", accepted at 2023 ICML Workshop on Computati…
☆12Aug 31, 2023Updated 2 years ago
XMUDeepLIT / DAMAML
View on GitHub
Code for "Domain Adaptive Meta-learning for Dialogue State Tracking"(TASLP2021)
☆10Sep 14, 2021Updated 4 years ago
daehoum1 / pcfi
View on GitHub
Confidence-Based Feature Imputation for Graphs with Partially Known Features (ICLR 2023)
☆11Sep 20, 2025Updated 10 months ago
JongSuk1 / AVCap
View on GitHub
☆11Sep 1, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ursulalujun / UESTCnote
View on GitHub
UESTC 2020级在读本科生，整理了一些学习笔记，希望能够帮助到学弟学妹们❤
☆14Sep 18, 2023Updated 2 years ago
AI45Lab / IS-Bench
View on GitHub
[AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆47Nov 24, 2025Updated 8 months ago
XMUDeepLIT / ABDNMT-RNMT
View on GitHub
Code for "Exploiting reverse target-side contexts for neural machine translation via asynchronous bidirectional decoding" (Artificial Int…
☆11Dec 27, 2022Updated 3 years ago
hpcaitech / Elixir
View on GitHub
Elixir: Train a Large Language Model on a Small GPU Cluster
☆16Jun 8, 2023Updated 3 years ago
G-AILab / IGRM
View on GitHub
☆16Nov 15, 2023Updated 2 years ago
quanshr / DMoERM
View on GitHub
[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
☆17Jun 6, 2024Updated 2 years ago
SukwonYun / S-Mixup
View on GitHub
[CIKM 2023 Short] Code for the paper "S-Mixup: Structural Mixup for Graph Neural Networks"
☆18Aug 21, 2023Updated 2 years ago