Shwai-He / PAD-NetView external linksLinks
Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".
☆11Apr 7, 2024Updated last year
Alternatives and similar repositories for PAD-Net
Users that are interested in PAD-Net are comparing it to the libraries listed below
Sorting:
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆21Apr 7, 2024Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Apr 7, 2024Updated last year
- The official implementation of the paper "Rethinking Pruning for Vision-Language Models: Strategies for Effective Sparsity".☆14Jul 2, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆28Oct 1, 2025Updated 4 months ago
- The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".☆88Mar 19, 2025Updated 10 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆188Nov 14, 2025Updated 3 months ago
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 5 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- [CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising☆16Apr 4, 2024Updated last year
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆82Apr 10, 2023Updated 2 years ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated last year
- Code and data for EACL 2024 paper "Contextualization Distillation from Large Language Models for Knowledge Graph Completion"☆24Oct 17, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- Pytorch version of Continuous Language Generative Flow (ACL 2021)☆11Sep 14, 2021Updated 4 years ago
- personal settings for linux tools, including zsh, vim, tmux, pip.☆11Dec 2, 2019Updated 6 years ago
- ☆10May 16, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- 计算TFIDF的三种方法:Python、sklearn、gensim☆11Feb 26, 2019Updated 6 years ago
- Code to reproduce the paper "Do causal predictors generalize better to new domains?"☆15Feb 7, 2025Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated 10 months ago
- [NeurIPS 2023] Latent Graph Inference with Limited Supervision☆16Feb 1, 2024Updated 2 years ago
- The Python solutions of leetcode☆13Apr 26, 2020Updated 5 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 11 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- ☆13Jun 25, 2025Updated 7 months ago
- The official implementation of the ICML'24 paper "A Graph is Worth K Words: Euclideanizing Graph using Pure Transformer".☆47Mar 19, 2025Updated 10 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Mar 28, 2023Updated 2 years ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- ☆11May 1, 2022Updated 3 years ago
- [WNGT(2019)] On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation☆11Apr 27, 2022Updated 3 years ago
- Explicit Sentence Compression for Neural Machine Translation☆10May 12, 2020Updated 5 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆10Nov 2, 2015Updated 10 years ago