nishantsubramani / steering_vectors
Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings
☆9Updated 2 years ago
Related projects: ⓘ
- ☆16Updated 10 months ago
- ☆23Updated 2 weeks ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated 11 months ago
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Updated 2 years ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆48Updated last year
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated last year
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 3 years ago
- ☆28Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 2 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- Codebase for public release of the plug-and-blend framework.☆22Updated 2 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆53Updated last year
- ☆38Updated 3 years ago
- Learning to Model Editing Processes☆26Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Updated 2 years ago
- ☆13Updated 10 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- ☆34Updated 4 months ago
- Embedding Recycling for Language models☆38Updated last year
- Automatic metrics for GEM tasks☆61Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 2 years ago