SJTU-DENG-Lab/AdaMoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SJTU-DENG-Lab/AdaMoE)

SJTU-DENG-Lab / AdaMoE

[Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models

☆20

Alternatives and similar repositories for AdaMoE

Users that are interested in AdaMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SJTU-DENG-Lab / Orthogonal-Neural-operator
View on GitHub
Code for orthogonal neural operator
☆17Oct 15, 2023Updated 2 years ago
SJTU-DENG-Lab / SIFT
View on GitHub
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆57Mar 6, 2025Updated last year
SJTU-DENG-Lab / LoPA
View on GitHub
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
☆39Apr 25, 2026Updated 2 months ago
Graph-COM / StruRW
View on GitHub
[ICML 2023] Structural Re-weighting Improves Graph Domain Adaptation (StruRW)
☆22Jun 20, 2023Updated 3 years ago
hao-ai-lab / JacobiForcing
View on GitHub
[ICML 2026] Jacobi Forcing: Fast and Accurate Diffusion-style Decoding
☆122Feb 20, 2026Updated 5 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SJTU-DENG-Lab / Mantis
View on GitHub
[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆92Jun 5, 2026Updated last month
thunlp / SparsingLaw
View on GitHub
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆32Nov 12, 2024Updated last year
SJTU-DENG-Lab / LatentUM
View on GitHub
☆56Apr 9, 2026Updated 3 months ago
THUNLP-MT / CODIS
View on GitHub
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
☆13Oct 14, 2024Updated last year
qiuzh20 / EMoE
View on GitHub
Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]
☆39May 28, 2024Updated 2 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
HelmholtzAI-FZJ / flex_gen
View on GitHub
☆20Jan 10, 2025Updated last year
razonyang / caddy-dnspodcn
View on GitHub
Caddy2 DNSPod.cn DNS Provider 模块
☆11May 9, 2025Updated last year
junwu6 / GRADE
View on GitHub
Non-IID Transfer Learning on Graphs
☆13Jul 4, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
tomsal / endtoenddecisiontrees
View on GitHub
Implementation of the paper End-to-end Learning of Deterministic Decision Trees
☆17May 19, 2022Updated 4 years ago
Graph-COM / Pair-Align
View on GitHub
[ICML 2024] Code for Pairwise Alignment Improves Graph Domain Adaptation (Pair-Align)
☆14Jun 15, 2024Updated 2 years ago
black4321 / InterBERT
View on GitHub
The official implementation of InterBERT
☆11Oct 18, 2022Updated 3 years ago
leibniz-future-lab / SelfDistill-SER
View on GitHub
☆18Apr 28, 2023Updated 3 years ago
Barcavin / efficient-node-labelling
View on GitHub
Code for Neurips 2024 paper: "Pure Message Passing Can Estimate Common Neighbor for Link Prediction"
☆17Oct 8, 2024Updated last year
SJTU-DENG-Lab / Discrete-Diffusion-Forcing
View on GitHub
Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference
☆261Feb 3, 2026Updated 5 months ago
Shen-Lab / GDA-SpecReg
View on GitHub
[ICLR 2023] "Graph Domain Adaptation via Theory-Grounded Spectral Regularization" by Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen
☆25Feb 27, 2023Updated 3 years ago
FAU-LMS / NCN_for_M2M
View on GitHub
Neural image compression models optimized for Mask R-CNN from paper "Boosting Neural Image Compression for Machines Using Latent Space Ma…
☆10Aug 16, 2022Updated 3 years ago
LARS-research / TREFE
View on GitHub
Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022
☆13Nov 25, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hanjiale / GenPT
View on GitHub
[Findings of EMNLP 2022] Code of paper Generative Prompt Tuning for Relation Classification. https://arxiv.org/abs/2210.12435
☆20May 7, 2023Updated 3 years ago
gy-7 / mmyolo
View on GitHub
YOLOv9 implement with mmyolo
☆12Jul 8, 2024Updated 2 years ago
weitongseu / PCL
View on GitHub
☆10Jul 11, 2022Updated 4 years ago
hao-ai-lab / flash-attention-fp4
View on GitHub
NVFP4 Flash-Attention 4 on BlackWell
☆28Updated this week
MIV-XJTU / EvoPrompt
View on GitHub
PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).
☆13Apr 15, 2024Updated 2 years ago
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
SJTU-DENG-Lab / LOVECon
View on GitHub
Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"
☆43Oct 26, 2023Updated 2 years ago
BUPT-GAMMA / CaGCN
View on GitHub
This repo is for source code of NeurIPS 2021 paper "Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration".
☆23Jan 4, 2022Updated 4 years ago
zhaozh10 / DenseCLIP
View on GitHub
☆10Aug 31, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Yaoming95 / CIAT
View on GitHub
code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation
☆18Oct 19, 2022Updated 3 years ago
dourgey / qwen2_moe_mergekit
View on GitHub
根据Qwen2（Qwen1.5）模型生成qwen2 MoE模型的工具
☆15Mar 29, 2024Updated 2 years ago
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year
ZoomLabCMU / puzzlebot
View on GitHub
☆11May 22, 2023Updated 3 years ago
JamesYang568 / Attention-guided-Feature-Fusion-for-Small-Object-Detection
View on GitHub
Implementation of 'Attention-guided Feature Fusion for Small Object Detection'
☆14Dec 21, 2023Updated 2 years ago
MarkXCloud / CSpD
View on GitHub
The official repo of continuous speculative decoding
☆36Mar 28, 2025Updated last year
WNJXYK / DeCoOp
View on GitHub
☆16Jun 4, 2024Updated 2 years ago