laekov / fastmoe
View external linksLinks

A fast MoE impl for PyTorch

☆1,831

Alternatives and similar repositories for fastmoe

Users that are interested in fastmoe are comparing it to the libraries listed below

Sorting:

microsoft / Tutel
View on GitHub
Tutel MoE: Optimized Mixture-of-Experts Library, Support GptOss/DeepSeek/Kimi-K2/Qwen3 using FP8/NVFP4/MXFP4
☆963Dec 21, 2025Updated last month
davidmrau / mixture-of-experts
View on GitHub
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
☆1,225Apr 19, 2024Updated last year
XueFuzhao / OpenMoE
View on GitHub
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,657Mar 8, 2024Updated last year
google-research / vmoe
View on GitHub
☆705Dec 6, 2025Updated 2 months ago
thu-pacman / FasterMoE
View on GitHub
☆89Apr 2, 2022Updated 3 years ago
facebookresearch / fairscale
View on GitHub
PyTorch extensions for high performance and large scale training.
☆3,397Apr 26, 2025Updated 9 months ago
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆15,162Updated this week
XueFuzhao / awesome-mixture-of-experts
View on GitHub
A collection of AWESOME things about mixture-of-experts
☆1,262Dec 8, 2024Updated last year
lucidrains / mixture-of-experts
View on GitHub
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
☆848Sep 13, 2023Updated 2 years ago
deepspeedai / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆2,224Aug 14, 2025Updated 6 months ago
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,392Mar 27, 2024Updated last year
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆22,231Updated this week
codecaution / Awesome-Mixture-of-Experts-Papers
View on GitHub
A curated reading list of research in Mixture-of-Experts(MoE).
☆660Oct 30, 2024Updated last year
bytedance / lightseq
View on GitHub
LightSeq: A High Performance Library for Sequence Processing and Generation
☆3,304May 16, 2023Updated 2 years ago
thu-pacman / SmartMoE-AE
View on GitHub
ATC23 AE
☆46May 11, 2023Updated 2 years ago
Tencent / TurboTransformers
View on GitHub
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
☆1,542Jul 18, 2025Updated 6 months ago
flexflow / flexflow-train
View on GitHub
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
☆1,859Feb 7, 2026Updated last week
pjlab-sys4nlp / llama-moe
View on GitHub
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
☆1,003Dec 6, 2024Updated last year
Harry-Chen / InfMoE
View on GitHub
Inference framework for MoE layers based on TensorRT with Python binding
☆41May 31, 2021Updated 4 years ago
deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆41,578Feb 7, 2026Updated last week
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,152Feb 7, 2026Updated last week
bigscience-workshop / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆1,433Mar 20, 2024Updated last year
alpa-projects / alpa
View on GitHub
Training and serving large-scale neural networks with auto parallelization.
☆3,180Dec 9, 2023Updated 2 years ago
shawntan / scattermoe
View on GitHub
Triton-based implementation of Sparse Mixture of Experts.
☆265Oct 3, 2025Updated 4 months ago
zms1999 / SmartMoE
View on GitHub
A MoE impl for PyTorch, [ATC'23] SmartMoE
☆71Jul 11, 2023Updated 2 years ago
haoliuhl / ringattention
View on GitHub
Large Context Attention
☆766Oct 13, 2025Updated 4 months ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,021Jan 23, 2026Updated 3 weeks ago
microsoft / torchscale
View on GitHub
Foundation Architecture for (M)LLMs
☆3,130Apr 11, 2024Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
☆8,989Feb 6, 2026Updated last week
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆4,935Updated this week
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆18,387Updated this week
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,705Jun 25, 2024Updated last year
deepseek-ai / DeepSeek-MoE
View on GitHub
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
☆1,894Jan 16, 2024Updated 2 years ago
bytedance / byteps
View on GitHub
A high performance and generic framework for distributed DNN training
☆3,717Oct 3, 2023Updated 2 years ago
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆4,701Updated this week
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,155Jan 22, 2024Updated 2 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,143Sep 30, 2025Updated 4 months ago
verl-project / verl
View on GitHub
verl: Volcano Engine Reinforcement Learning for LLMs
☆19,132Updated this week
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations of state-of-the-art linear attention models
☆4,379Updated this week

laekov / fastmoeView external linksLinks

Alternatives and similar repositories for fastmoe

laekov / fastmoe
View external linksLinks