Supercomputing-System-AI-Lab/X-MoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Supercomputing-System-AI-Lab/X-MoE)

Supercomputing-System-AI-Lab / X-MoE

☆28

Alternatives and similar repositories for X-MoE

Users that are interested in X-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Supercomputing-System-AI-Lab / MiLo
View on GitHub
Code repo for efficient quantized MoE inference with mixture of low-rank compensators
☆39Apr 14, 2025Updated last year
Supercomputing-System-AI-Lab / VecFlow
View on GitHub
☆36May 31, 2026Updated last month
merthidayetoglu / CommBench
View on GitHub
A Micro-benchmarking Tool for HPC Networks
☆37Sep 2, 2025Updated 10 months ago
yuandong-tian / understanding
View on GitHub
Understanding deep networks and large models.
☆30Jan 23, 2026Updated 5 months ago
mburaksayici / Why
View on GitHub
Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.
☆11Feb 19, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vinbhaskara / AdamS
View on GitHub
PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]
☆16Apr 30, 2026Updated 2 months ago
gpu-mode / triton-tutorials
View on GitHub
☆16May 14, 2025Updated last year
David-Li0406 / SMoA
View on GitHub
☆15Jan 24, 2025Updated last year
neilliang90 / Sadam
View on GitHub
☆14Aug 28, 2019Updated 6 years ago
hemingkx / SWIFT
View on GitHub
[ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
☆70Feb 21, 2025Updated last year
aliyun / syccl
View on GitHub
☆24Sep 10, 2025Updated 10 months ago
OFS / ofs-platform-afu-bbb
View on GitHub
OFS Platform Components
☆19Jul 6, 2026Updated 2 weeks ago
Xiaohao-Liu / L-MTP
View on GitHub
[NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
☆32May 8, 2026Updated 2 months ago
Dao-AILab / sonic-moe
View on GitHub
Accelerating MoE with IO and Tile-aware Optimizations
☆732Jul 4, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lgxi24 / AdaBlock-dLLM
View on GitHub
[ICLR 2026] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
☆15Jan 28, 2026Updated 5 months ago
nestordemeure / AdaHessianJax
View on GitHub
Jax implementation of the AdaHessian optimizer
☆19Mar 11, 2021Updated 5 years ago
csinva / cookiecutter-ml-research
View on GitHub
A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪
☆19Apr 9, 2026Updated 3 months ago
vickiegpt / wiki
View on GitHub
A place to store my knowledge base
☆12Apr 27, 2026Updated 2 months ago
haotiansun14 / BBox-Adapter
View on GitHub
Lightweight Adapting for Black-Box Large Language Models
☆25Feb 15, 2024Updated 2 years ago
nestordemeure / ManifoldMixup
View on GitHub
Manifold-Mixup implementation for fastai V1
☆19Oct 1, 2020Updated 5 years ago
sjduan / LeHDC
View on GitHub
☆16Mar 18, 2025Updated last year
JL-Cheng / SERE
View on GitHub
[ICLR 2026] SERE: Similarity-Based Expert Re-routing for Efficient Batch Decoding in MoE Models
☆18Feb 4, 2026Updated 5 months ago
astra-sim / tacos
View on GitHub
TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning
☆37Jun 13, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
libozhu03 / RIFLE
View on GitHub
☆15Oct 17, 2025Updated 9 months ago
plctlab / riscv-cluster
View on GitHub
Towards a million-node RISC-V cluster.
☆14Mar 6, 2025Updated last year
hgyhungry / alcop-artifact
View on GitHub
☆25Mar 15, 2023Updated 3 years ago
NVlabs / SRSA
View on GitHub
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
☆19Mar 25, 2026Updated 3 months ago
slm-mux / SLM-MUX
View on GitHub
☆25Mar 26, 2026Updated 3 months ago
neelsomani / kv-marketplace
View on GitHub
Cross-GPU KV Cache Marketplace
☆26Nov 12, 2025Updated 8 months ago
zhaochenyang20 / sglang-diffusion-routing
View on GitHub
A demonstrative example of running SGLang Diffusion with DP router
☆17Mar 15, 2026Updated 4 months ago
saeziae / aquote
View on GitHub
信创群友语录
☆13Nov 5, 2022Updated 3 years ago
skylight-org / sparse-attention-hub
View on GitHub
Advancing the frontier of efficient AI
☆66Jul 10, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AAzdi / Sparse-BitNet
View on GitHub
☆15Mar 10, 2026Updated 4 months ago
lmgame-org / GRL
View on GitHub
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
☆65Dec 18, 2025Updated 7 months ago
SamsungSAILMontreal / nino
View on GitHub
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]
☆30Feb 20, 2026Updated 5 months ago
HLC-Lab / pico
View on GitHub
PICO: Performance Insights for Collective Operations
☆19Jul 7, 2026Updated 2 weeks ago
MYMY-young / DelimScaling
View on GitHub
[ICLR 2026] Official implementation of "Enhancing Multi-Image Understanding Through Delimiter Token Scaling"
☆15Jul 10, 2026Updated last week
mlvlab / CAF
View on GitHub
Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024
☆35Oct 31, 2025Updated 8 months ago
ByteDance-Seed / StragglerAnalysis
View on GitHub
☆56Apr 30, 2025Updated last year