Shwai-He/PAD-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shwai-He/PAD-Net)

Shwai-He / PAD-Net

Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".

☆14

Alternatives and similar repositories for PAD-Net

Users that are interested in PAD-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shwai-He / SparseAdapter
View on GitHub
Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"
☆23Feb 28, 2026Updated 4 months ago
Shwai-He / MEO
View on GitHub
The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":
☆47Feb 28, 2026Updated 4 months ago
Shwai-He / VLM-Compression
View on GitHub
The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".
☆17Jul 2, 2024Updated 2 years ago
Shwai-He / SparseUnifiedModel
View on GitHub
The official implementation of the paper "Understanding and Harnessing Sparsity in Unified Multimodal Models".
☆23Apr 25, 2026Updated 3 months ago
CASE-Lab-UMD / Capacity-Aware-MoE
View on GitHub
The official implementation of the paper "Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts" (ICLR 2026).
☆20May 31, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CASE-Lab-UMD / Router-Tuning-Mixture-of-Depths
View on GitHub
The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…
☆31Jul 20, 2026Updated last week
CASE-Lab-UMD / Unified-MoE-Compression
View on GitHub
The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".
☆89Feb 28, 2026Updated 4 months ago
CASE-Lab-UMD / LLM-Drop
View on GitHub
The official implementation of the paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".
☆191Apr 23, 2026Updated 3 months ago
Yibin-Lei / CSQE
View on GitHub
Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"
☆13Mar 19, 2024Updated 2 years ago
Timothyxxx / NeuralSymbolicPapers
View on GitHub
☆14Aug 18, 2022Updated 3 years ago
Hai-chao-Zhang / OOSTraj
View on GitHub
[CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
☆16Apr 4, 2024Updated 2 years ago
Hai-chao-Zhang / VQToken
View on GitHub
[NeurIPS 2025] Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models
☆17Nov 10, 2025Updated 8 months ago
ImKeTT / AutoRec-Pytorch
View on GitHub
[Tool] AutoRec (2015) PyTorch Implementation
☆10Mar 1, 2020Updated 6 years ago
alphadl / R1
View on GitHub
🚀enhanced GRPO with more verifiable rewards and real-time evaluators
☆37Jan 27, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DaizeDong / Stanford-CS231n-2021-and-2022
View on GitHub
Notes and slides for Stanford CS231n 2021 & 2022 in English. I merged the contents together to get a better version. Assignments are not …
☆27Updated this week
RZFan525 / Awesome-ScalingLaws
View on GitHub
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆84Apr 10, 2023Updated 3 years ago
DaizeDong / Easier-PS-and-SoP
View on GitHub
A LaTeX framework to handle Personal Statement (PS) and Statement of Purpose (SoP) for multiple university applications.
☆25Updated this week
David-Li0406 / Contextulization-Distillation
View on GitHub
Code and data for EACL 2024 paper "Contextualization Distillation from Large Language Models for Knowledge Graph Completion"
☆25Oct 17, 2024Updated last year
alphadl / SafeLLM_with_IntentionAnalysis
View on GitHub
Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting
☆21Mar 25, 2024Updated 2 years ago
uw-x / underwatergps
View on GitHub
☆20Nov 18, 2023Updated 2 years ago
cambridge-mlg / SPVAE
View on GitHub
Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)
☆12Jun 7, 2020Updated 6 years ago
A4Bio / GraphsGPT
View on GitHub
The official implementation of the ICML'24 paper "A Graph is Worth K Words: Euclideanizing Graph using Pure Transformer".
☆49Mar 19, 2025Updated last year
RZFan525 / NLP-PhD-Application-In-The-World
View on GitHub
The information of NLP PhD application in the world.
☆37Aug 27, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
Hai-chao-Zhang / ThinkJEPA
View on GitHub
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
☆46Jul 19, 2026Updated last week
UNITES-Lab / MoE-RBench
View on GitHub
[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"
☆11Jul 1, 2024Updated 2 years ago
nowazrabbani / pMoE_CNN
View on GitHub
The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…
☆14Feb 12, 2026Updated 5 months ago
Yibin-Lei / MetaEOL
View on GitHub
Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"
☆12Jul 25, 2024Updated 2 years ago
wyzjack / CNTP
View on GitHub
[ACL 2025] Cautious Next Token Prediction
☆16Jul 24, 2025Updated last year
RamyaLab / pluralistic-alignment
View on GitHub
The open-source repository for PAL: Sample-Efficient Personalized Reward Modeling for Pluralistic Alignment, which provides a general per…
☆17Aug 28, 2025Updated 10 months ago
Yangyi-Chen / MAYA
View on GitHub
Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".
☆13Apr 18, 2022Updated 4 years ago
XinyuSun / awesome-self-supervised-representation-learning
View on GitHub
awesome video representation learning
☆15Mar 22, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ICTMCG / SDTM
View on GitHub
Official repository for "Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration", which has been …
☆17Sep 29, 2025Updated 9 months ago
back18 / QuanLib.Minecraft
View on GitHub
☆51May 5, 2026Updated 2 months ago
ziyaow1010 / FedHyper
View on GitHub
Pytorch Code for FedHyper
☆11Aug 28, 2024Updated last year
LLM360 / k2-data-prep
View on GitHub
☆21Jun 4, 2024Updated 2 years ago
fabrahman / char-centric-story
View on GitHub
Codebase for character-centric story understanding
☆14Jan 20, 2022Updated 4 years ago
cunliangkong / linux-envs
View on GitHub
personal settings for linux tools, including zsh, vim, tmux, pip.
☆11Dec 2, 2019Updated 6 years ago
tml-epfl / long-is-more-for-alignment
View on GitHub
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆21May 2, 2024Updated 2 years ago