thu-pacman/SmartMoE-AE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thu-pacman/SmartMoE-AE)

thu-pacman / SmartMoE-AE

ATC23 AE

☆45

Alternatives and similar repositories for SmartMoE-AE

Users that are interested in SmartMoE-AE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zms1999 / SmartMoE
View on GitHub
A MoE impl for PyTorch, [ATC'23] SmartMoE
☆73Jul 11, 2023Updated 3 years ago
thu-pacman / lab-guide
View on GitHub
Everything about PACMAN!
☆19May 28, 2026Updated last month
YJHMITWEB / ExFlow
View on GitHub
Explore Inter-layer Expert Affinity in MoE Model Inference
☆16May 6, 2024Updated 2 years ago
pkusys / ElasticFlow
View on GitHub
Artifacts for our ASPLOS'23 paper ElasticFlow
☆56May 10, 2024Updated 2 years ago
thu-pacman / FasterMoE
View on GitHub
☆92Apr 2, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
alpa-projects / mms
View on GitHub
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆94Jul 14, 2023Updated 3 years ago
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
roastduck / FreeTensor
View on GitHub
A language and compiler for irregular tensor programs.
☆152Jul 16, 2026Updated last week
kanonjz / paper
View on GitHub
Machine Learning System
☆14May 11, 2020Updated 6 years ago
ruipeterpan / marconi
View on GitHub
Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]
☆63Mar 5, 2025Updated last year
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
laekov / panleaf
View on GitHub
Write pandoc markdown in OverLeaf
☆12Sep 28, 2022Updated 3 years ago
cometeme / funcoder
View on GitHub
Implementation for NeurIPS 2024 oral paper: Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation
☆16Jan 27, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
STAR-Laboratory / Accelerating-RecSys-Training
View on GitHub
Accelerating Recommender model training by leveraging popular choices -- VLDB 2022
☆30Sep 15, 2024Updated last year
microsoft / AutoMoE
View on GitHub
AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
☆48Oct 21, 2022Updated 3 years ago
H-Huang / torch_collective_extension
View on GitHub
A minimum demo for PyTorch distributed extension functionality for collectives.
☆15Jul 29, 2024Updated last year
LLMServe / DistServe
View on GitHub
Disaggregated serving system for Large Language Models (LLMs).
☆826Apr 6, 2025Updated last year
yuyangJin / PerFlow-AI
View on GitHub
PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.
☆33May 12, 2026Updated 2 months ago
abcdabcd987 / libfabric-efa-demo
View on GitHub
☆82Jan 5, 2025Updated last year
S-Lab-System-Group / Lucid
View on GitHub
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆61May 21, 2023Updated 3 years ago
l3lackcurtains / dbscan-kdtree-cuda
View on GitHub
Massively parallel DBSCAN algorithm implemented in CUDA along with a KD-Tree for searching neighbors.
☆13Sep 21, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / ParrotServe
View on GitHub
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
☆223Sep 21, 2024Updated last year
uw-mad-dash / shockwave
View on GitHub
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆46Nov 24, 2022Updated 3 years ago
mlc-ai / mlc-python
View on GitHub
☆36Jul 19, 2025Updated last year
uds-lsv / SIDP
View on GitHub
Robust Differentially Private Training of Deep Neural Networks
☆12Dec 10, 2020Updated 5 years ago
OrderLab / awesome-machine-learning-reliability
View on GitHub
A curated reading list for machine learning reliability research and practice
☆31Sep 18, 2025Updated 10 months ago
thustorage / shiftlock
View on GitHub
[FAST'25] ShiftLock: Mitigate One-sided RDMA Lock Contention via Handover.
☆20Feb 11, 2025Updated last year
microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
xalanq / ITree
View on GitHub
A Geek TreeView Markdown Editor
☆18Mar 4, 2018Updated 8 years ago
decoding-comp-trust / comp-trust
View on GitHub
Codebase for decoding compressed trust.
☆27May 7, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Sys-Inventor-Lab / AI4System-OSML
View on GitHub
☆14Feb 26, 2026Updated 5 months ago
S-Lab-System-Group / Hydro
View on GitHub
Surrogate-based Hyperparameter Tuning System
☆30Jun 29, 2023Updated 3 years ago
Harry-Chen / InfMoE
View on GitHub
Inference framework for MoE layers based on TensorRT with Python binding
☆40May 31, 2021Updated 5 years ago
Marishwaran99 / Flappy-bird
View on GitHub
☆12Oct 1, 2020Updated 5 years ago
osayamenja / FlashMoE
View on GitHub
Distributed MoE in a Single Kernel [NeurIPS '25]
☆275May 5, 2026Updated 2 months ago
renmengye / np-conv2d
View on GitHub
2D Convolution using NumPy
☆17May 26, 2022Updated 4 years ago
kaiu85 / hm-rnn
View on GitHub
PyTorch Implementation of Hierarchical Multiscale Recurrent Neural Networks
☆15Nov 13, 2018Updated 7 years ago