RUCAIBox/MPOE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RUCAIBox/MPOE)

RUCAIBox / MPOE

☆19

Alternatives and similar repositories for MPOE

Users that are interested in MPOE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / Stochastic-Mixture-of-Experts
View on GitHub
This package implements THOR: Transformer with Stochastic Experts.
☆64Oct 7, 2021Updated 4 years ago
nowazrabbani / pMoE_CNN
View on GitHub
The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…
☆14Feb 12, 2026Updated 5 months ago
codecaution / EvoMoE
View on GitHub
☆21Oct 31, 2022Updated 3 years ago
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
livingoptics / spatial-spectral-ml
View on GitHub
Spatial Spectral Machine Learning
☆14Oct 15, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thu-pacman / FasterMoE
View on GitHub
☆92Apr 2, 2022Updated 4 years ago
Xilinx / SDFEC-PYNQ
View on GitHub
A PYNQ overlay demonstrating the Xilinx RFSoC SD-FEC
☆13Jun 29, 2022Updated 4 years ago
delcypher / nsolv
View on GitHub
Nsolv - A front-end that allows multiple SMTLIBv2 compliant solvers to executed in parallel.
☆11Dec 7, 2012Updated 13 years ago
FPGA-Research / FPGAVirusScanner
View on GitHub
Program to scan for malicious FPGA designs.
☆17Mar 20, 2021Updated 5 years ago
ryao / llama3.c
View on GitHub
A fork of llama3.c used to do some R&D on inferencing
☆23Dec 20, 2024Updated last year
Shwai-He / PAD-Net
View on GitHub
Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆14Feb 28, 2026Updated 5 months ago
ZiCog / xoroshiro
View on GitHub
the xoroshiro32++ and xoroshiro64++ PRNG algorthims by David Blackman and Sebastiano Vigna in C++, Verilog, VHDL and SpinalHDL.
☆16Dec 2, 2018Updated 7 years ago
FPGA-Research / zynq-ultrascale-readback-capture
View on GitHub
This document adopts the method from the XAPP1230 for doing readback capture on Xilinx UltraScale devices and shows how to migrate the sa…
☆18Nov 15, 2019Updated 6 years ago
Shwai-He / MEO
View on GitHub
The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":
☆47Feb 28, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
UNITES-Lab / MC-SMoE
View on GitHub
[ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
☆108Jun 20, 2025Updated last year
Hunter-DDM / stablemoe
View on GitHub
Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"
☆52Jul 17, 2022Updated 4 years ago
enyac-group / UniQL
View on GitHub
UniQL official repository (ICLR 2026)
☆17Jan 27, 2026Updated 6 months ago
chrysts / generative_preconditioner
View on GitHub
☆11Oct 8, 2020Updated 5 years ago
kanonjz / paper
View on GitHub
Machine Learning System
☆14May 11, 2020Updated 6 years ago
AlexandraVolokhova / stochasticity_in_neural_ode
View on GitHub
"Stochasticity in Neural ODEs: An Empirical Study". Experiments from the paper
☆13Apr 27, 2020Updated 6 years ago
tianyang-x / Mixture-of-Domain-Adapters
View on GitHub
Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…
☆51Oct 8, 2023Updated 2 years ago
mirjanastojilovic / RDS
View on GitHub
FPGA routing delay sensors for effective remote power analysis attacks
☆14Aug 13, 2024Updated last year
swtheing / LLM-Performance-Improvement-Paper
View on GitHub
☆17Jul 10, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
st01tyy / LightScale
View on GitHub
Lightweight and Scalable Post-training: The Ray-Free, Debug-Friendly Alignment Stack with Megatron-native simplicity.
☆55May 20, 2026Updated 2 months ago
Crimsonninja / senior_design_puf
View on GitHub
Repository to store all design and testbench files for Senior Design
☆22Apr 16, 2020Updated 6 years ago
YanglanOu / patcher
View on GitHub
☆27Aug 23, 2022Updated 3 years ago
nicolaihaeni / corn
View on GitHub
Official Pytorch implementation of Continuous Object Representation Networks: Novel View Synthesis without 3D or Target View Supervision
☆15Nov 8, 2020Updated 5 years ago
r-three / smear
View on GitHub
☆30Sep 28, 2023Updated 2 years ago
InterDigitalInc / DialogSummary-VideoQA
View on GitHub
☆10Mar 30, 2022Updated 4 years ago
tldoan / PCA-OGD
View on GitHub
Code for PCA-OGD (AISTATS 2021)
☆11Mar 16, 2021Updated 5 years ago
Ceaglex / LoVA
View on GitHub
The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) arc…
☆16Feb 27, 2025Updated last year
sony / MambaPEFT
View on GitHub
☆23Mar 27, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
efeslab / fiddler
View on GitHub
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
☆267Nov 18, 2024Updated last year
Ekoda / SoftMoE
View on GitHub
Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.
☆16Aug 13, 2023Updated 2 years ago
modelize-ai / LLM-Inference-Deployment-Tutorial
View on GitHub
Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…
☆19Sep 5, 2023Updated 2 years ago
JungHoyoun / PromptCompressor
View on GitHub
☆12Apr 29, 2024Updated 2 years ago
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
giangdip2410 / HyperRouter
View on GitHub
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Nov 29, 2023Updated 2 years ago
Haskely / gsm8k-rft-llama7b-u13b_evaluation
View on GitHub
测试 https://huggingface.co/OFA-Sys/gsm8k-rft-llama7b-u13b 的 GSM8K 分数
☆15Aug 10, 2023Updated 2 years ago