Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆11Feb 28, 2026Updated 3 weeks ago
Alternatives and similar repositories for PAD-Net
Users that are interested in PAD-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆21Feb 28, 2026Updated 3 weeks ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Feb 28, 2026Updated 3 weeks ago
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆15Jul 2, 2024Updated last year
- The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".☆89Feb 28, 2026Updated 3 weeks ago
- The official implementation of the paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".☆188Mar 6, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- ☆14Aug 18, 2022Updated 3 years ago
- [CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising☆17Apr 4, 2024Updated last year
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆81Apr 10, 2023Updated 2 years ago
- Code and data for EACL 2024 paper "Contextualization Distillation from Large Language Models for Knowledge Graph Completion"☆24Oct 17, 2024Updated last year
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated 2 years ago
- Paper list of Video LLM hallucination. Welcome to Star and Contribute!☆23Mar 6, 2026Updated 3 weeks ago
- [NeurIPS 2023] Latent Graph Inference with Limited Supervision☆27Feb 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The official implementation of the ICML'24 paper "A Graph is Worth K Words: Euclideanizing Graph using Pure Transformer".☆48Mar 19, 2025Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 11 months ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated last month
- Pytorch Code for FedHyper☆11Aug 28, 2024Updated last year
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 3 years ago
- awesome video representation learning☆15Mar 22, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Jun 4, 2024Updated last year
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆15Feb 7, 2025Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- ☆12Sep 23, 2024Updated last year
- ☆13Jul 14, 2024Updated last year
- Website for HKU NLP group (under construction)☆14Mar 20, 2026Updated last week
- ☆19Sep 15, 2022Updated 3 years ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- 🚀enhanced GRPO with more verifiable rewards and real-time evaluators☆37Jan 27, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Jan 6, 2017Updated 9 years ago
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 5 months ago
- Remote Access your GitHub Actions via Browser Based VS Code☆26Jan 12, 2025Updated last year
- ☆20Oct 31, 2022Updated 3 years ago
- ☆19Jan 5, 2023Updated 3 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 3 years ago