Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆12Feb 28, 2026Updated last month
Alternatives and similar repositories for PAD-Net
Users that are interested in PAD-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆45Feb 28, 2026Updated last month
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆16Jul 2, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆31Mar 26, 2026Updated 3 weeks ago
- The official implementation of the paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".☆189Mar 6, 2026Updated last month
- ☆14Aug 18, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆82Apr 10, 2023Updated 3 years ago
- Code and data for EACL 2024 paper "Contextualization Distillation from Large Language Models for Knowledge Graph Completion"☆24Oct 17, 2024Updated last year
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- The official implementation of the ICML'24 paper "A Graph is Worth K Words: Euclideanizing Graph using Pure Transformer".☆48Mar 19, 2025Updated last year
- [NeurIPS 2023] Latent Graph Inference with Limited Supervision☆31Feb 1, 2024Updated 2 years ago
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- Pytorch Code for FedHyper☆11Aug 28, 2024Updated last year
- 🚀 First survey on Attention Sink in Transformers — 180+ papers on utilization, interpretation, and mitigation.☆49Updated this week
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 3 years ago
- awesome video representation learning☆15Mar 22, 2021Updated 5 years ago
- ☆21Jun 4, 2024Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆15Feb 7, 2025Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jun 25, 2025Updated 9 months ago
- ☆12Sep 23, 2024Updated last year
- ☆11May 1, 2022Updated 3 years ago
- ☆13Jul 14, 2024Updated last year
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Jan 6, 2017Updated 9 years ago
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 5 months ago
- ☆19Jan 5, 2023Updated 3 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 3 years ago
- [ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization☆26Mar 30, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Codes and data for EMNLP 2021 paper "Self- and Pseudo-self-supervised Prediction of Speaker and Key-utterance for Multi-party Dialogue Re…☆16Oct 15, 2022Updated 3 years ago
- ☆13Sep 28, 2022Updated 3 years ago
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- Estimate MFU for DeepSeekV3☆26Jan 5, 2025Updated last year
- ☆21Feb 10, 2025Updated last year
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆19Apr 5, 2026Updated last week
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆20Feb 16, 2024Updated 2 years ago