ChenZiHong-Gavin / MoE-VisualizerView external linksLinks
MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.
☆16Apr 8, 2025Updated 10 months ago
Alternatives and similar repositories for MoE-Visualizer
Users that are interested in MoE-Visualizer are comparing it to the libraries listed below
Sorting:
- Pytorch Code for FedHyper☆11Aug 28, 2024Updated last year
- Code release for AdapMoE accepted by ICCAD 2024☆35Apr 28, 2025Updated 9 months ago
- ☆22Oct 3, 2024Updated last year
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆34Mar 6, 2025Updated 11 months ago
- CRAI is a multimodal large language model based on the Mixture of Experts (MoE) architecture, supporting text and image cross-modal tasks…☆16Apr 29, 2025Updated 9 months ago
- Data Structure & Algorithm (UCB, spring 2018)☆21Apr 27, 2018Updated 7 years ago
- The code of 《M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis》☆14Mar 31, 2025Updated 10 months ago
- ☆16Feb 23, 2025Updated 11 months ago
- Chameleon: A MatMul-Free TCN Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data☆25Jun 6, 2025Updated 8 months ago
- 抓取Here地图的三维建筑物模型☆12Jun 29, 2017Updated 8 years ago
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- ☆12Nov 29, 2020Updated 5 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- KAF : Kolmogorov-Arnold Fourier Networks☆20Feb 19, 2025Updated 11 months ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- ☆18Oct 14, 2025Updated 3 months ago
- This is an Augmented Reality application which will help in learning about Wild life animal by creating an augmented Zoo and Spread awar…☆10Nov 1, 2018Updated 7 years ago
- [ICML 2025 Oral] Mixture of Lookup Experts☆71Dec 3, 2025Updated 2 months ago
- [ECCV 2024] CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs☆18Jul 2, 2024Updated last year
- A whisper repo for TPU☆10Jun 4, 2024Updated last year
- 使用WebGL和Node.js技术构建复旦三维社交网络。目前实现了校园模型demo显示,多用户在线聊天。☆12Jun 12, 2015Updated 10 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- ☆11Mar 27, 2023Updated 2 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- A small demo about three & cannon.User can toll the dice by click the dice.☆12Oct 12, 2024Updated last year
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆14Jul 2, 2024Updated last year
- 东南大学计软智(主要是东南大学人工智能学院)-部分课程项目与实验 Part of the course experiment in SEU AI. Include: Computer Graphics - Knowledge Engineering - Network a…☆16Jul 3, 2024Updated last year
- Problem B: 3D Placement with D2D Vertical Connections☆11Jun 30, 2022Updated 3 years ago
- [CVPR 2025] Efficient Personalization of Quantized Diffusion Model without Backpropagation☆15Mar 31, 2025Updated 10 months ago
- ICCAD-2021-B☆12Aug 5, 2021Updated 4 years ago
- Mixture-of-Experts Multimodal Variational Autoencoder☆15Jul 3, 2025Updated 7 months ago
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆13Mar 17, 2025Updated 10 months ago
- An auto encoder based system to predict the recipe of food from its images☆10Jul 25, 2020Updated 5 years ago
- Used FPGA board and System Verilog to design controller, DMA, pipelined SIMD processor, and GEMM accelerator☆12Aug 26, 2023Updated 2 years ago
- Scaling Laws for Mixture of Experts Models☆15Feb 25, 2025Updated 11 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆60Feb 7, 2025Updated last year
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆21Apr 16, 2025Updated 9 months ago
- Python implementation of Medoidshift and Quickshift algorithms☆15Feb 5, 2015Updated 11 years ago