ChenZiHong-Gavin/MoE-Visualizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChenZiHong-Gavin/MoE-Visualizer)

ChenZiHong-Gavin / MoE-Visualizer

MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.

☆16

Alternatives and similar repositories for MoE-Visualizer

Users that are interested in MoE-Visualizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ziyaow1010 / FedHyper
View on GitHub
Pytorch Code for FedHyper
☆11Aug 28, 2024Updated last year
SeedScientist / OmicsBench
View on GitHub
[WAICA 2026] Beyond Accuracy: Can LLMs Provide Biologically Grounded Evidence in Sequence-Based Omics Tasks?
☆21May 14, 2026Updated 2 months ago
ChenZiHong-Gavin / LLM-Everything
View on GitHub
以帮助你快速找到 LLM 相关工作，尽快抓住 AI 红利为目标的【LLM 教程】
☆167Updated this week
wangyccn / CR-AI-V1.5
View on GitHub
CRAI is a multimodal large language model based on the Mixture of Experts (MoE) architecture, supporting text and image cross-modal tasks…
☆16Apr 29, 2025Updated last year
InternScience / SeedBench
View on GitHub
[ACL 2025] SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science🌾
☆23Dec 19, 2025Updated 7 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
guanchuwang / Taylor-Unswift
View on GitHub
☆22Oct 3, 2024Updated last year
wrmedford / moe-scaling
View on GitHub
Scaling Laws for Mixture of Experts Models
☆15Feb 25, 2025Updated last year
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
zdebruine / MMVAE
View on GitHub
Mixture-of-Experts Multimodal Variational Autoencoder
☆15Jul 3, 2025Updated last year
TornadoInsight / HealthPlus
View on GitHub
This System for a Health Plus. The system includes Registration of patients, Making appointments, Storing patient records, Billing in the…
☆56Nov 20, 2025Updated 8 months ago
Yibin-Lei / CSQE
View on GitHub
Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"
☆13Mar 19, 2024Updated 2 years ago
he-h / ST-MoE-BERT
View on GitHub
This repository contains the code for the paper "ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mo…
☆16Feb 20, 2025Updated last year
Yibin-Lei / MetaEOL
View on GitHub
Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"
☆12Jul 25, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Taishi-N324 / Drop-Upcycling
View on GitHub
[ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization
☆25Oct 5, 2025Updated 9 months ago
ziyaow1010 / vla-datasets-benchmarks
View on GitHub
A curated list of datasets and benchmarks for Vision-Language-Action (VLA) research, with a focus on evaluation protocols and practical g…
☆31Apr 28, 2026Updated 2 months ago
ttw1018 / MoPE-DST
View on GitHub
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
kamanphoebe / Look-into-MoEs
View on GitHub
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆61Feb 7, 2025Updated last year
JarvisPei / CMoE
View on GitHub
[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis
☆46Jun 30, 2026Updated 3 weeks ago
The-Swarm-Corporation / Mamba-R1
View on GitHub
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Oct 13, 2025Updated 9 months ago
vivekghanchi / ARGeeks
View on GitHub
This is an Augmented Reality application which will help in learning about Wild life animal by creating an augmented Zoo and Spread awar…
☆10Nov 1, 2018Updated 7 years ago
Kangningthu / SUM
View on GitHub
Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).
☆16Jan 9, 2025Updated last year
CASE-Lab-UMD / Router-Tuning-Mixture-of-Depths
View on GitHub
The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…
☆31Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
stellarloop / Health-Care-Facility
View on GitHub
A Management System for a Health Care Facility. The system includes Registration of patients, Making appointments, Storing patient record…
☆88Nov 20, 2025Updated 8 months ago
ImKeTT / AutoRec-Pytorch
View on GitHub
[Tool] AutoRec (2015) PyTorch Implementation
☆10Mar 1, 2020Updated 6 years ago
zwxandy / Awesome-Efficient-CoT-Reasoning-Summary
View on GitHub
🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…
☆65May 22, 2025Updated last year
kyegomez / MHMoE
View on GitHub
Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
☆30Updated this week
casszhao / PruneHall
View on GitHub
Codebase, data and models for hallucination of pruned models
☆16Jan 11, 2025Updated last year
xiaoqqchen / Here3DSpider
View on GitHub
抓取Here地图的三维建筑物模型
☆12Jun 29, 2017Updated 9 years ago
scaleapi / propensity-evaluation
View on GitHub
open Source code for propensity evaluation
☆19Apr 25, 2026Updated 3 months ago
LiYuhangUSTC / Lines2Face
View on GitHub
☆10Aug 28, 2020Updated 5 years ago
Hai-chao-Zhang / VQToken
View on GitHub
[NeurIPS 2025] Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models
☆17Nov 10, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kyegomez / LIMoE
View on GitHub
Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…
☆35Updated this week
Shwai-He / VLM-Compression
View on GitHub
The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".
☆17Jul 2, 2024Updated 2 years ago
ylsung / ECoFLaP
View on GitHub
Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)
☆21Feb 16, 2024Updated 2 years ago
PKU-RL / ResDex
View on GitHub
Official code for "Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping" (ICLR 2025)
☆30Oct 25, 2025Updated 9 months ago
JieShibo / MoLE
View on GitHub
[ICML 2025 Oral] Mixture of Lookup Experts
☆78Dec 3, 2025Updated 7 months ago
Ther-nullptr / circult-eda-mlsys-tinyml-arxiv-daily
View on GitHub
🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)
☆10Updated this week
idealclover / NJU-Thesis-Word
View on GitHub
南京大学本科毕业论文 Word 模板
☆12May 14, 2020Updated 6 years ago