Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆12Feb 28, 2026Updated 2 months ago
Alternatives and similar repositories for PAD-Net
Users that are interested in PAD-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆22Feb 28, 2026Updated 2 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆46Feb 28, 2026Updated 2 months ago
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆16Jul 2, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆31Apr 26, 2026Updated last week
- The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".☆91Feb 28, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of the paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".☆190Apr 23, 2026Updated 2 weeks ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- [CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising☆17Apr 4, 2024Updated 2 years ago
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- Notes and slides for Stanford CS231n 2021 & 2022 in English. I merged the contents together to get a better version. Assignments are not …☆27Sep 30, 2022Updated 3 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆84Apr 10, 2023Updated 3 years ago
- Code and data for EACL 2024 paper "Contextualization Distillation from Large Language Models for Knowledge Graph Completion"☆25Oct 17, 2024Updated last year
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Nov 18, 2023Updated 2 years ago
- [ACL 2026] Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆31Updated this week
- The official implementation of the ICML'24 paper "A Graph is Worth K Words: Euclideanizing Graph using Pure Transformer".☆48Mar 19, 2025Updated last year
- Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)☆12Jun 7, 2020Updated 5 years ago
- [NeurIPS 2023] Latent Graph Inference with Limited Supervision☆33Feb 1, 2024Updated 2 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 11 months ago
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 2 months ago
- ☆50Updated this week
- Pytorch Code for FedHyper☆11Aug 28, 2024Updated last year
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 4 years ago
- awesome video representation learning☆15Mar 22, 2021Updated 5 years ago
- ☆21Jun 4, 2024Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆16Feb 7, 2025Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Codebase for character-centric story understanding☆14Jan 20, 2022Updated 4 years ago
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- ☆14Jun 25, 2025Updated 10 months ago
- ☆12Sep 23, 2024Updated last year
- ☆11May 1, 2022Updated 4 years ago
- Website for HKU NLP group (under construction)☆14Mar 20, 2026Updated last month
- ☆13Jul 14, 2024Updated last year