Shwai-He / PAD-Net
Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆9Updated last year
Alternatives and similar repositories for PAD-Net:
Users that are interested in PAD-Net are comparing it to the libraries listed below
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆37Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆18Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆23Updated last week
- Model merging is a highly efficient approach for long-to-short reasoning.☆43Updated last month
- ☆13Updated 3 weeks ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 5 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆59Updated 7 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 8 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆71Updated 2 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- ☆29Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Updated last year
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆47Updated 2 months ago
- An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length…☆10Updated this week
- ☆17Updated last year
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆32Updated 10 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆29Updated 11 months ago
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆20Updated last month
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆60Updated 10 months ago
- ☆130Updated 9 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆55Updated last year
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆20Updated 2 months ago
- ☆17Updated 11 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆25Updated 5 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆136Updated last month
- ☆14Updated 6 months ago
- ☆18Updated 5 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆51Updated 2 years ago
- ☆22Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year