dongxinshuai / RIFT-NeurIPS2021Links
☆11Updated 3 years ago
Alternatives and similar repositories for RIFT-NeurIPS2021
Users that are interested in RIFT-NeurIPS2021 are comparing it to the libraries listed below
Sorting:
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- ☆24Updated 4 years ago
- ☆25Updated 2 weeks ago
- ☆29Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆66Updated 2 years ago
- ☆44Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆44Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆38Updated 2 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆75Updated last year
- Implementation for Variational Information Bottleneck for Effective Low-resource Fine-tuning, ICLR 2021☆40Updated 4 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- ☆24Updated 4 years ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- ☆17Updated 4 years ago
- ☆107Updated 3 years ago
- Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Const…☆62Updated last year
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆102Updated 2 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated 9 months ago
- tianlu-wang / Identifying-and-Mitigating-Spurious-Correlations-for-Improving-Robustness-in-NLP-ModelsNAACL 2022 Findings☆15Updated 3 years ago
- Augmenting Statistical Models with Natural Language Parameters☆27Updated 9 months ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆17Updated 3 months ago
- ☆35Updated 6 months ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆21Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- ☆21Updated 2 years ago
- Codebase for running (conditional) probing experiments☆22Updated 2 years ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆17Updated last year
- ☆26Updated 4 years ago