ShiZhengyan / PowerfulPromptFTView external linksLinks
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"
☆76Feb 4, 2024Updated 2 years ago
Alternatives and similar repositories for PowerfulPromptFT
Users that are interested in PowerfulPromptFT are comparing it to the libraries listed below
Sorting:
- [NAACL 2022] Dataset and codes for the paper titled "Learning to Execute Actions or Ask Clarification Questions" in Findings of NAACL 202…☆12Jul 25, 2022Updated 3 years ago
- [AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in…☆32Mar 20, 2024Updated last year
- [ECIR 2024] Official repository for the paper titled "Self Contrastive Learning for Session-based Recommendation"☆21Apr 3, 2024Updated last year
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38May 24, 2024Updated last year
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆17Mar 21, 2025Updated 10 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆77Jan 27, 2024Updated 2 years ago
- GPT as Knowledger Worker (or if you really want, GPT Sorta' Takes the CPA Exam)☆13Jan 24, 2023Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- ☆13Jul 22, 2023Updated 2 years ago
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- Pytorch (PyG) and Tensorflow (Keras/Spektral) implementation of Total Variation Graph Neural Network (TVGNN), as presented at ICML 2023.☆20Mar 15, 2025Updated 10 months ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆101May 3, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Code and data for paper "(How) do Language Models Track State?"☆21Mar 31, 2025Updated 10 months ago
- ☆18Feb 2, 2026Updated last week
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Jan 5, 2026Updated last month
- ☆46Nov 8, 2024Updated last year
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.☆40Apr 25, 2022Updated 3 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Dec 4, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- An alternative UI for playing with ChatGPT☆21Mar 14, 2023Updated 2 years ago
- ☆20Apr 17, 2023Updated 2 years ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆87Sep 12, 2025Updated 5 months ago
- ☆21Oct 6, 2023Updated 2 years ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆20Jul 24, 2023Updated 2 years ago
- ☆130Aug 18, 2022Updated 3 years ago
- ☆20Jul 6, 2023Updated 2 years ago
- Code for the Ask4Help project☆22Nov 24, 2022Updated 3 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 10 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52May 19, 2023Updated 2 years ago
- Hierarchical State Space Models☆49Apr 12, 2024Updated last year
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Nov 30, 2022Updated 3 years ago