google-deepmind / exedec
☆12Updated 11 months ago
Alternatives and similar repositories for exedec:
Users that are interested in exedec are comparing it to the libraries listed below
- ☆22Updated 2 months ago
- ☆13Updated last year
- ☆14Updated 5 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆18Updated 2 months ago
- ☆18Updated 9 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated 3 weeks ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 7 months ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Updated 8 months ago
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆24Updated 2 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆29Updated 3 months ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆16Updated 5 months ago
- ☆17Updated 11 months ago
- ☆17Updated 9 months ago
- ☆25Updated 8 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 7 months ago
- ☆29Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 10 months ago
- A library for efficient patching and automatic circuit discovery.☆62Updated 2 months ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago
- Augmenting Statistical Models with Natural Language Parameters☆25Updated 7 months ago
- ☆41Updated last year
- Self-Supervised Alignment with Mutual Information☆16Updated 10 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 5 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated last week
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- ☆28Updated last month
- ☆27Updated 9 months ago
- Exploration of automated dataset selection approaches at large scales.☆38Updated last month
- ☆31Updated last year
- Rewarded soups official implementation☆58Updated last year