BaohaoLiao / meftsView external linksLinks
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
☆33Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for mefts
Users that are interested in mefts are comparing it to the libraries listed below
Sorting:
- The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…☆20Jan 18, 2025Updated last year
- ☆11Oct 12, 2023Updated 2 years ago
- [ECMLPKDD 2020] "Topological Insights into Sparse Neural Networks"☆13May 2, 2022Updated 3 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Dec 4, 2024Updated last year
- Source code for ACL 2020 paper "A Span-based Linearization for Constituent Trees"☆13Jan 12, 2022Updated 4 years ago
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- ☆18Mar 10, 2023Updated 2 years ago
- PyTorch implementation of Language model compression with weighted low-rank factorization☆13Jun 28, 2023Updated 2 years ago
- Code and data for paper "(How) do Language Models Track State?"☆21Mar 31, 2025Updated 10 months ago
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 3 years ago
- Crawl & visualize ICLR papers and reviews.☆18Nov 5, 2022Updated 3 years ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆69Oct 15, 2024Updated last year
- ☆23Jan 27, 2025Updated last year
- ☆18Nov 6, 2019Updated 6 years ago
- Parameter Efficient Transfer Learning with Diff Pruning☆75Feb 3, 2021Updated 5 years ago
- ☆52Jan 19, 2023Updated 3 years ago
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆19Apr 14, 2024Updated last year
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- [CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection☆18Oct 3, 2024Updated last year
- ☆20Dec 16, 2020Updated 5 years ago
- ☆58Jul 9, 2024Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Sep 18, 2023Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 8 months ago
- Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.☆285Jun 27, 2023Updated 2 years ago
- Code for paper "Dependency-based Mixture Language Models" by Zhixian Yang, and Xiaojun Wan. This paper is accepted by ACL 2022 Main Confe…☆26May 27, 2022Updated 3 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated last year
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆57May 10, 2023Updated 2 years ago
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Feb 28, 2025Updated 11 months ago
- Libraries for efficient and scalable group-structured dataset pipelines.☆25Jun 18, 2025Updated 7 months ago
- ☆34Aug 23, 2023Updated 2 years ago