tilde-research/MoMoE-impl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tilde-research/MoMoE-impl)

tilde-research / MoMoE-impl

Memory optimized Mixture of Experts

☆75

Alternatives and similar repositories for MoMoE-impl

Users that are interested in MoMoE-impl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lcy-seso / DLFrameworkTest
View on GitHub
My tests and experiments with some popular dl frameworks.
☆17Sep 11, 2025Updated 7 months ago
Infini-AI-Lab / vortex_torch
View on GitHub
Vortex: A Flexible and Efficient Sparse Attention Framework
☆50Apr 6, 2026Updated last week
infinigence / HamiltonAttention
View on GitHub
☆44Oct 15, 2025Updated 5 months ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆19Apr 1, 2026Updated last week
infinigence / FUSCO
View on GitHub
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
☆117Mar 7, 2026Updated last month
DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
allenai / DataDecide
View on GitHub
☆41Aug 20, 2025Updated 7 months ago
duykhuongnguyen / MAT-Steer
View on GitHub
☆18Aug 19, 2025Updated 7 months ago
JoeLi12345 / nGPT
View on GitHub
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆111Mar 7, 2025Updated last year
GbotHQ / Blender-3D-document-rendering-pipeline
View on GitHub
Render documents on a virtual paper with folds and other types of damage using blender geometry nodes.
☆26Aug 14, 2023Updated 2 years ago
damek / gd-lean
View on GitHub
☆36Feb 8, 2026Updated 2 months ago
tile-ai / tilescale
View on GitHub
Tile-based language built for AI computation across all scales
☆141Mar 27, 2026Updated 2 weeks ago
NVIDIA / hoti-2025-gpu-comms-tutorial
View on GitHub
Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025
☆31Oct 22, 2025Updated 5 months ago
KuangjuX / NVSHMEM-Tutorial
View on GitHub
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆174Feb 11, 2026Updated 2 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
RDMA-Rust / sideway
View on GitHub
A better wrapper for using RDMA programming APIs in Rust flavor
☆80Updated this week
zhuozhiyongde / SysY-Compiler-2024Fall-PKU
View on GitHub
北京大学 2024 秋季学期编译原理课程 Lab 代码、笔记、经验
☆18Sep 12, 2025Updated 7 months ago
KempnerInstitute / llm_uncertainty
View on GitHub
Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"
☆11Apr 15, 2024Updated last year
eth-easl / mixtera
View on GitHub
A lightweight, user-friendly data-plane for LLM training.
☆38Sep 10, 2025Updated 7 months ago
openshift-psap / auto-tuning-vllm
View on GitHub
Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
☆51Mar 17, 2026Updated 3 weeks ago
lucidrains / simplicial-attention
View on GitHub
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…
☆47Sep 2, 2025Updated 7 months ago
zhuzilin / flash-attention-with-sink
View on GitHub
☆38Aug 7, 2025Updated 8 months ago
microsoft / ArchScale
View on GitHub
Simple & Scalable Pretraining for Neural Architecture Research
☆327Mar 31, 2026Updated last week
viking-sudo-rm / rusty-dawg
View on GitHub
Rust library for indexing and quickly searching large pretraining corpora
☆31Oct 30, 2025Updated 5 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lemyx / tilelang-dsa
View on GitHub
DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang
☆44Nov 19, 2025Updated 4 months ago
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆36Jun 1, 2025Updated 10 months ago
fla-org / hybrid-distillation
View on GitHub
☆31Dec 31, 2025Updated 3 months ago
Chengsong-Huang / RelayLLM
View on GitHub
☆38Jan 10, 2026Updated 3 months ago
lucasjinreal / Namors
View on GitHub
Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.
☆24Mar 12, 2025Updated last year
outerform / xv6-pku-hints
View on GitHub
hints for xv6lab in installing and doing
☆12Jan 28, 2021Updated 5 years ago
arashkaffamanesh / terraform-aws-eks
View on GitHub
Terraform module to create an Elastic Kubernetes (EKS) cluster and associated worker instances on AWS
☆14Aug 26, 2020Updated 5 years ago
Victarry / PyTorch-Memory-Profiler
View on GitHub
☆46Sep 8, 2025Updated 7 months ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆10Dec 30, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
sar-mo / CS2051-HonorsDiscreteMath
View on GitHub
A collection of resources for CS 2051, an undergraduate Honors Discrete Mathematics course at Georgia Tech.
☆10Jun 24, 2023Updated 2 years ago
seancribbs / essentials-of-compilation
View on GitHub
☆16Nov 2, 2025Updated 5 months ago
Kernel-Machines / kermac
View on GitHub
Pytorch routines for (Ker)nel (Mac)hines
☆11Oct 10, 2025Updated 6 months ago
hugobowne / build-your-own-deep-research-agent
View on GitHub
☆79Mar 28, 2026Updated 2 weeks ago
snowflakedb / ArcticTraining
View on GitHub
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆281Apr 3, 2026Updated last week
plibither8 / otp-forwarder
View on GitHub
💬 Forward OTP SMS's from my phone to my laptop as a desktop notification
☆11Aug 26, 2022Updated 3 years ago
peytontolbert / simple-moe
View on GitHub
Simple MoE - Day 17 of 365 Days of Repos
☆18Jan 17, 2025Updated last year