[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"
☆76Feb 4, 2024Updated 2 years ago
Alternatives and similar repositories for PowerfulPromptFT
Users that are interested in PowerfulPromptFT are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"☆102Apr 10, 2024Updated last year
- [AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in…☆32Mar 20, 2024Updated last year
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated 11 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆54Sep 25, 2025Updated 5 months ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Jan 24, 2023Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- ☆13Jul 22, 2023Updated 2 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Pytorch (PyG) and Tensorflow (Keras/Spektral) implementation of Total Variation Graph Neural Network (TVGNN), as presented at ICML 2023.☆20Mar 15, 2025Updated 11 months ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12May 31, 2023Updated 2 years ago
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆101May 3, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Replication package for ICSE 2022 submission titled "Automatic Merge Conflict Resolution Tools: The Current State and Barriers to Adoptio…☆12Sep 14, 2021Updated 4 years ago
- ☆18Feb 2, 2026Updated last month
- ☆14May 31, 2022Updated 3 years ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 27, 2026Updated last week
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Multi-head Recurrent Layer Attention for Vision Network☆22Mar 2, 2023Updated 3 years ago
- Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representa…☆37Jan 18, 2022Updated 4 years ago
- The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"☆19Jul 24, 2024Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 7 months ago
- ☆47Nov 8, 2024Updated last year
- The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.☆39Apr 25, 2022Updated 3 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- ☆20Apr 17, 2023Updated 2 years ago
- ☆29Apr 22, 2025Updated 10 months ago
- The official PyTorch implementation for SIGIR 2024 paper "SIGformer: sign-aware graph transformer for recommendation".☆20Sep 4, 2024Updated last year
- ☆28May 27, 2024Updated last year
- This project develops compact transformer models tailored for clinical text analysis, balancing efficiency and performance for healthcare…☆18Mar 26, 2024Updated last year
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Dec 4, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆88Sep 12, 2025Updated 5 months ago
- [EMNLP 2021] Code and data for our paper "Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers…☆20Jan 17, 2022Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆25Jun 22, 2022Updated 3 years ago