Noumena-Network/nmoe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Noumena-Network/nmoe)

Noumena-Network / nmoe

MoE training for Me and You and maybe other people

☆394

Alternatives and similar repositories for nmoe

Users that are interested in nmoe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Noumena-Network / code
View on GitHub
Noumena's internal version of your favorite coding tui but improved and optimized for our inference stack
☆134Jul 2, 2026Updated 2 weeks ago
Noumena-Network / NSA-Test
View on GitHub
NSA Triton Kernels written with GPT5 and Opus 4.1
☆70Aug 12, 2025Updated 11 months ago
Dao-AILab / sonic-moe
View on GitHub
Accelerating MoE with IO and Tile-aware Optimizations
☆732Jul 4, 2026Updated 2 weeks ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,698Updated this week
xjdr-alt / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆3,434Nov 13, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PrimeIntellect-ai / renderers
View on GitHub
Programmable chat templates for LLM training and inference.
☆133Updated this week
microsoft / dion
View on GitHub
Dion optimizer algorithm
☆494Jul 12, 2026Updated last week
xjdr-alt / simple_transformer
View on GitHub
Simple Transformer in Jax
☆143Jun 22, 2024Updated 2 years ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,389Updated this week
arcee-ai / pybubble
View on GitHub
☆81Feb 18, 2026Updated 5 months ago
KellerJordan / modded-nanogpt
View on GitHub
NanoGPT (124M) in 90 seconds
☆5,518Jul 3, 2026Updated 2 weeks ago
Dao-AILab / quack
View on GitHub
A Quirky Assortment of CuTe Kernels
☆1,063Updated this week
Infatoshi / docs.md
View on GitHub
☆93Dec 16, 2025Updated 7 months ago
sgl-project / mini-sglang
View on GitHub
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
☆4,607May 17, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
drbh / yamoe
View on GitHub
🔀 yet another mixture of experts
☆23Jun 5, 2026Updated last month
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
better-call-paul / blackwell_gemm
View on GitHub
☆17Apr 26, 2026Updated 2 months ago
SzymonOzog / Penny
View on GitHub
Hand-Rolled GPU communications library
☆96Nov 25, 2025Updated 7 months ago
secemp9 / rubrics
View on GitHub
a bunch of rubrics I made in different format and structure for llm judge and other use cases
☆16Sep 22, 2025Updated 9 months ago
ChinmayK0607 / heiretsu
View on GitHub
Educational WIP
☆73Feb 16, 2026Updated 5 months ago
furlat / Abstractions
View on GitHub
A Collection of Pydantic Models to Abstract IRL
☆41Dec 10, 2025Updated 7 months ago
dataflowr / gpu_llm_flash-attention
View on GitHub
Course on Flash-attention in Triton
☆100Feb 9, 2026Updated 5 months ago
fal-ai / diffusion-speedrun
View on GitHub
Focused on fast experimentation and simplicity
☆77Dec 24, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
VatsaDev / NanoPoor
View on GitHub
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Apr 22, 2025Updated last year
meta-pytorch / torchforge
View on GitHub
PyTorch-native post-training at scale
☆696Updated this week
marin-community / marin
View on GitHub
Open-source framework for the research and development of foundation models.
☆1,212Updated this week
tokenbender / avataRL
View on GitHub
rl from zero pretrain, can it be done? yes.
☆295Sep 28, 2025Updated 9 months ago
doomslide / attention-graph
View on GitHub
A graph visualization of attention
☆56May 20, 2025Updated last year
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,545Updated this week
HomebrewML / HeavyBall
View on GitHub
Efficient optimizers
☆335Jul 11, 2026Updated last week
microsoft / ArchScale
View on GitHub
Simple & Scalable Pretraining for Neural Architecture Research
☆336Mar 31, 2026Updated 3 months ago
NVIDIA-NeMo / Emerging-Optimizers
View on GitHub
☆209Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HazyResearch / ThunderKittens
View on GitHub
Tile primitives for speedy kernels
☆3,552Jul 13, 2026Updated last week
Aleph-Alpha / Alpha-MoE
View on GitHub
☆66Dec 10, 2025Updated 7 months ago
arcee-ai / trinity-large-tech-report
View on GitHub
☆126Feb 19, 2026Updated 5 months ago
hallerite / ludic
View on GitHub
Ludic – an LLM-RL library for the era of experience
☆67Jan 9, 2026Updated 6 months ago
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,551Updated this week
huggingface / picotron
View on GitHub
Minimalistic 4D-parallelism distributed training framework for education purpose
☆2,254Aug 26, 2025Updated 10 months ago
TextArena / UnstableBaselines
View on GitHub
☆120Apr 7, 2026Updated 3 months ago