promoe-opensource/promoe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/promoe-opensource/promoe)

promoe-opensource / promoe

☆20

Alternatives and similar repositories for promoe

Users that are interested in promoe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IntelliSys-Lab / FineMoE-EuroSys26
View on GitHub
☆15Sep 25, 2025Updated 10 months ago
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
EfficientMoE / MoE-Infinity
View on GitHub
PyTorch library for cost-effective, fast and easy serving of MoE models.
☆321Updated this week
hyungyokim / LIA_AMXGPU
View on GitHub
[ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading
☆13Jun 28, 2025Updated last year
PKU-SEC-Lab / HybriMoE
View on GitHub
[DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"
☆118Dec 15, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VITA-Group / Q-Hitter
View on GitHub
☆15Jun 4, 2024Updated 2 years ago
caoshiyi / artifacts
View on GitHub
☆40Nov 28, 2024Updated last year
PKU-SEC-Lab / AdapMoE
View on GitHub
Code release for AdapMoE accepted by ICCAD 2024
☆39Apr 28, 2025Updated last year
gajagajago / deepshare
View on GitHub
Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)
☆20Jul 8, 2025Updated last year
mlsys-io / helium_demo
View on GitHub
☆23May 2, 2026Updated 2 months ago
MincYu / gillis-open-source
View on GitHub
☆27May 31, 2023Updated 3 years ago
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
LoongServe / LoongServe
View on GitHub
☆135Nov 11, 2024Updated last year
goliaro / specinfer-ae
View on GitHub
☆28Mar 14, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SJTU-IPADS / PipeLLM
View on GitHub
☆28Dec 22, 2024Updated last year
HanLi05869 / JLU-SNL-COMPILER
View on GitHub
This is a JLU-SNL-COMPILER project
☆13Oct 6, 2023Updated 2 years ago
litchi-lee / NJU_2024_Dissys_Final
View on GitHub
南京大学2024研究生秋季学期分布式系统期末复习
☆36Jun 26, 2026Updated 3 weeks ago
kungfu-team / tenplex
View on GitHub
Dynamic resources changes for multi-dimensional parallelism training
☆31Aug 22, 2025Updated 11 months ago
splatlab / vqf
View on GitHub
A fast approximate membership query data structure
☆12Jul 16, 2024Updated 2 years ago
Mr-Linus / NodeSimulator
View on GitHub
NodeSimulator can simulate the node resources and state in kubernetes and simulate the state of pod.
☆11Nov 7, 2021Updated 4 years ago
aleasimulator / alea
View on GitHub
Advanced job scheduling simulator
☆18Nov 6, 2023Updated 2 years ago
eth-easl / dirigent
View on GitHub
Dirigent: Lightweight Serverless Orchestration
☆44Aug 26, 2025Updated 10 months ago
Yu-Maryland / RESPECT
View on GitHub
RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)
☆11Apr 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sands-lab / FOCUS
View on GitHub
[ICML 2026] Official implementation of "FOCUS: DLLMs Know How to Tame Their Compute Bound".
☆17May 5, 2026Updated 2 months ago
Sys-Inventor-Lab / AI4System-OSML
View on GitHub
☆14Feb 26, 2026Updated 4 months ago
HiEST / gpu-topo-aware
View on GitHub
GPU topology-aware scheduler
☆13Jul 7, 2017Updated 9 years ago
kingsum / PIPA
View on GitHub
Platform Integrated Performance Analytics (枇杷/琵琶)
☆12Jul 15, 2022Updated 4 years ago
pengyuzhang / FreeRider
View on GitHub
☆17Sep 28, 2017Updated 8 years ago
UNITES-Lab / HEXA-MoE
View on GitHub
Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"
☆15Mar 6, 2025Updated last year
rssys / pronghorn-artifact
View on GitHub
This artifact accompanies the paper 'Pronghorn: Effective Checkpoint Orchestration for Serverless Hot-Starts,' which has been accepted fo…
☆22Nov 8, 2023Updated 2 years ago
yhliu918 / Learn-to-Compress
View on GitHub
The source code for paper LeCo: Lightweight Compression via Learning Serial Correlations (SIGMOD'24).
☆17Mar 26, 2024Updated 2 years ago
efeslab / fiddler
View on GitHub
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
☆267Nov 18, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SJTU-IPADS / PhoenixOS-Remoting
View on GitHub
☆21Jul 10, 2025Updated last year
xiaoleeza / English-for-post-graduate
View on GitHub
研究生英语综合教程原文+翻译
☆10Mar 24, 2017Updated 9 years ago
SJTU-IPADS / fgnn-artifacts
View on GitHub
FGNN's artifact evaluation (EuroSys 2022)
☆18Apr 25, 2022Updated 4 years ago
IntelliSys-Lab / RainbowCake-ASPLOS24
View on GitHub
☆42Nov 5, 2023Updated 2 years ago
LLMkvsys / rethink-kv-compression
View on GitHub
☆24Mar 7, 2025Updated last year
LukasPfromm / CHIPSIM
View on GitHub
A co-simulation framework for chiplet-based systems executing DNN models.
☆17Feb 16, 2026Updated 5 months ago
thangbk2209 / LSTM_GoogleClusterTraceData
View on GitHub
☆12Nov 21, 2017Updated 8 years ago