DavidFanzz/SCMoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DavidFanzz/SCMoE)

DavidFanzz / SCMoE

☆29

Alternatives and similar repositories for SCMoE

Users that are interested in SCMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuzh20 / RMoE
View on GitHub
Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)
☆33Aug 4, 2024Updated last year
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
VITA-Group / Random-MoE-as-Dropout
View on GitHub
[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…
☆56Feb 28, 2023Updated 3 years ago
YJHMITWEB / ExFlow
View on GitHub
Explore Inter-layer Expert Affinity in MoE Model Inference
☆16May 6, 2024Updated 2 years ago
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DoubtedSteam / RoE
View on GitHub
The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"
☆17Mar 24, 2025Updated last year
UNITES-Lab / MoE-Quantization
View on GitHub
Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"
☆30Jun 30, 2025Updated last year
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆66Dec 8, 2024Updated last year
mengcaopku / Continual-LLaVA
View on GitHub
☆16Nov 12, 2024Updated last year
Lucky-Lance / Expert_Sparsity
View on GitHub
[ACL 2024] Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
☆123May 24, 2024Updated 2 years ago
likaixin2000 / MMCode
View on GitHub
[EMNLP 2024] Multi-modal reasoning problems via code generation.
☆28Apr 14, 2026Updated 3 months ago
PKU-SEC-Lab / AdapMoE
View on GitHub
Code release for AdapMoE accepted by ICCAD 2024
☆39Apr 28, 2025Updated last year
whn09 / VITA
View on GitHub
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
☆11Jun 16, 2025Updated last year
tianyi-lab / MoE-Embedding
View on GitHub
[ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆92Oct 15, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
LHRYANG / FSD
View on GitHub
Implementation of LREC-COLING 2024 paper A Frustratingly Simple Decoding Method for Neural Text Generation
☆19Feb 23, 2024Updated 2 years ago
miaoyuchun / InfoRM
View on GitHub
The official implementation of InfoRM [NeurIPS 2024].
☆16Oct 25, 2025Updated 8 months ago
dmis-lab / Monet
View on GitHub
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
☆79Jun 23, 2025Updated last year
yuanxinnn / APTMoE
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
ugonfor / DGQ
View on GitHub
[ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models
☆19Mar 25, 2025Updated last year
floatingsun / transformer_layers_as_painters
View on GitHub
transformer layers behavior as painters🧑‍🎨
☆15May 6, 2025Updated last year
boringlee24 / socc22-miso
View on GitHub
MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters
☆21Apr 21, 2023Updated 3 years ago
FereshteShakeri / few-shot-MedVLMs
View on GitHub
☆33Oct 6, 2024Updated last year
ChartMimic / ChartMimic
View on GitHub
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆132Dec 19, 2025Updated 7 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Jihuai-wpy / InferAligner
View on GitHub
Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.
☆38Oct 2, 2024Updated last year
dextroushands / pretraind_model_for_nlp_tasks
View on GitHub
☆14Sep 19, 2022Updated 3 years ago
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
snu-mllab / Context-Memory
View on GitHub
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆64Apr 18, 2024Updated 2 years ago
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
waltonfuture / Diff-eRank
View on GitHub
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆59May 28, 2025Updated last year
JiazhengZhang / AgentV-RL
View on GitHub
☆15Apr 17, 2026Updated 3 months ago
wutaiqiang / awesome-GNN2MLP-distillation
View on GitHub
Learning MLPs to replace GNN
☆10Jun 3, 2023Updated 3 years ago
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
navidmdn / ESConv-SRA
View on GitHub
Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…
☆15Apr 14, 2025Updated last year
jiyt17 / ReDiff
View on GitHub
Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'
☆45Jun 27, 2026Updated 3 weeks ago
mlbio-epfl / joint-inference
View on GitHub
[ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners
☆22Jun 6, 2025Updated last year
alexhe101 / FourierISP
View on GitHub
Official implementation of AAAI-2024 paper "Enhancing RAW-to-sRGB with Decoupled Style Structure in Fourier Domain"
☆13Jun 17, 2024Updated 2 years ago
ustc-hyin / HiMAP
View on GitHub
Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference
☆14Jun 7, 2025Updated last year
haizhongzheng / LTE
View on GitHub
☆13Oct 13, 2025Updated 9 months ago