cmu-flame/FLAME-MoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cmu-flame/FLAME-MoE)

cmu-flame / FLAME-MoE

Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models

☆42

Alternatives and similar repositories for FLAME-MoE

Users that are interested in FLAME-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fedebotu / NeurIPS2022-OpenReviewData
View on GitHub
Crawl & Visualize NeurIPS 2022 Data from OpenReview
☆14Nov 8, 2022Updated 3 years ago
commoncrawl / ia-web-commons
View on GitHub
Web archiving utility library
☆11Jul 21, 2026Updated last week
xinhaoc / ferret
View on GitHub
Autonomous CUDA kernel optimization agent with structured task specs and per-config scoring
☆17Jun 17, 2026Updated last month
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
mlc-ai / pith-train
View on GitHub
Compact and Agent-Native MoE Training System
☆304Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LaVi-Lab / FTTT
View on GitHub
[ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
☆13May 16, 2025Updated last year
srihari-humbarwadi / TensorRT-for-keras
View on GitHub
Optimizing keras models using Nvidia TensorRT
☆13Aug 8, 2019Updated 6 years ago
OpenMatch / MARVEL
View on GitHub
[ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…
☆39Jun 30, 2024Updated 2 years ago
kssteven418 / SqueezeLLM-gradients
View on GitHub
☆21Feb 5, 2024Updated 2 years ago
juntang-zhuang / ACProp-Optimizer
View on GitHub
Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)
☆17Oct 11, 2021Updated 4 years ago
Infini-AI-Lab / S2FT
View on GitHub
☆19Jan 3, 2025Updated last year
oscardhc / Compiler-Spider
View on GitHub
http://spider.oscardhc.com
☆13Jul 9, 2020Updated 6 years ago
wu-qing-157 / RISCV-CPU
View on GitHub
A Homework for Computer Architecture at SJTU
☆14Jan 4, 2020Updated 6 years ago
liaoq / pnas2019
View on GitHub
☆11Nov 27, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZhenweiAn / Dynamic_MoE
View on GitHub
Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"
☆74Jul 30, 2024Updated last year
tlc-pack / TLCBench
View on GitHub
Benchmark scripts for TVM
☆75Mar 15, 2022Updated 4 years ago
alan-turing-institute / AnnotateChange
View on GitHub
A simple flask application to collect annotations for the Turing Change Point Dataset, a benchmark dataset for change point detection alg…
☆22Feb 18, 2021Updated 5 years ago
OpenMatch / Augmentation-Adapted-Retriever
View on GitHub
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…
☆60Jul 12, 2024Updated 2 years ago
ModelTC / MoDES
View on GitHub
[CVPR 2026] This is the official PyTorch implementation of "MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via D…
☆31Mar 16, 2026Updated 4 months ago
adebray / equivariant_homotopy_theory
View on GitHub
Notes from Andrew Blumberg's class on equivariant homotopy theory
☆17Aug 19, 2025Updated 11 months ago
LiJiasen-00921 / Arch2019_Assignment
View on GitHub
☆10Mar 18, 2020Updated 6 years ago
tgale96 / grouped_gemm
View on GitHub
PyTorch bindings for CUTLASS grouped GEMM.
☆154May 29, 2025Updated last year
bsxfan / PYLLR
View on GitHub
Python toolkit for likelihood-ratio calibration of binary classifiers
☆25Feb 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
PaulAlbert31 / RandLoRA
View on GitHub
☆31Jun 6, 2025Updated last year
lukeluocn / multicoresysu2020
View on GitHub
☆11Aug 4, 2020Updated 5 years ago
HectorHHZ / Sparse_Matrix_Tuning
View on GitHub
Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices
☆26Feb 2, 2026Updated 5 months ago
apache / tvm-ffi
View on GitHub
Open ABI and FFI for Machine Learning Systems
☆437Updated this week
norakassner / LAMA_primed_negated
View on GitHub
☆14Sep 17, 2020Updated 5 years ago
matttreed / diloco-sim
View on GitHub
☆23Jan 5, 2025Updated last year
xichenye0930 / Active-Negative-Loss
View on GitHub
☆14Dec 21, 2024Updated last year
TsinghuaC3I / ZEDA
View on GitHub
Post-Trained MoE Can Skip Half Experts via Self-Distillation
☆38May 19, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DBGroup-SUSTech / GPU-Merkle-Patricia-Trie
View on GitHub
An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…
☆14Apr 15, 2024Updated 2 years ago
Kennethborup / centered_kernel_alignment
View on GitHub
Implementation of Centered Kernel Alignment (CKA)
☆10Apr 7, 2021Updated 5 years ago
sythello / ChartDialog
View on GitHub
A dataset for training interactive plotting agent
☆14Dec 8, 2022Updated 3 years ago
ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
NVIDIA / jax-tvm-ffi
View on GitHub
JAX support for tvm-ffi abi
☆26May 14, 2026Updated 2 months ago
cms-opendata-analyses / DimuonSpectrumNanoAODOutreachAnalysis
View on GitHub
Analysis using reduced NanoAOD files created from CMS open data producing a high statistics di-muon spectrum
☆15Sep 5, 2023Updated 2 years ago
yangarbiter / rare-spurious-correlation
View on GitHub
Understanding Rare Spurious Correlations in Neural Network
☆12Jun 5, 2022Updated 4 years ago