rycolab / kl-rb
This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"
☆9Updated last month
Alternatives and similar repositories for kl-rb
Users that are interested in kl-rb are comparing it to the libraries listed below
Sorting:
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆16Updated 5 months ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 2 years ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆12Updated 2 years ago
- Conformal Language Modeling☆29Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated 2 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆23Updated 3 months ago
- Extending Conformal Prediction to LLMs☆66Updated 10 months ago
- ☆16Updated 5 months ago
- Few-shot Learning with Auxiliary Data☆27Updated last year
- ☆19Updated 10 months ago
- ☆36Updated 2 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆33Updated 3 years ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆106Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- ☆60Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆38Updated 2 years ago
- N/A☆18Updated 2 years ago
- ☆26Updated last year
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆53Updated 2 years ago
- Code for Residual Energy-Based Models for Text Generation in PyTorch.☆23Updated 4 years ago
- Universal Neurons in GPT2 Language Models☆29Updated 11 months ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆17Updated 2 years ago
- ☆34Updated 5 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆102Updated last year
- ☆40Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Updated 2 years ago
- ☆29Updated last year
- Learning adapter weights from task descriptions☆17Updated last year