declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆25Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for resta
- ☆33Updated last year
- ☆24Updated last year
- ☆40Updated 11 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆59Updated 8 months ago
- ☆78Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆46Updated 2 years ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆62Updated 2 years ago
- ☆44Updated 2 months ago
- Methods and evaluation for aligning language models temporally☆24Updated 8 months ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆97Updated last year
- Data Valuation on In-Context Examples (ACL23)☆23Updated 3 weeks ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- ☆26Updated 6 months ago
- ☆24Updated 6 months ago
- ☆23Updated last month
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆45Updated 4 months ago
- We have released the code and demo program required for LLM with self-verification☆47Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆63Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Updated last month
- ☆16Updated last year
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 2 years ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated last year
- Interpretable unified language safety checking with large language models☆30Updated last year
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated 2 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆105Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- ☆48Updated last year
- Personality Alignment of Language Models☆18Updated 2 months ago