google-deepmind / exedec
☆11Updated 10 months ago
Alternatives and similar repositories for exedec:
Users that are interested in exedec are comparing it to the libraries listed below
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Updated last year
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆24Updated last year
- ☆12Updated last year
- ☆15Updated 10 months ago
- ☆21Updated last month
- ☆14Updated 4 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 6 months ago
- Rewarded soups official implementation☆55Updated last year
- ☆14Updated 11 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- ☆33Updated last year
- ☆29Updated 4 months ago
- ☆21Updated 5 months ago
- ☆22Updated 11 months ago
- ☆80Updated 7 months ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆9Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 9 months ago
- ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…☆30Updated last year
- ☆29Updated last year
- ☆61Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆118Updated 6 months ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆15Updated 4 months ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆16Updated 9 months ago
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago
- ☆37Updated last year
- ☆30Updated 4 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- ☆30Updated 3 months ago
- Sparse Autoencoder Training Library☆43Updated 4 months ago
- ☆11Updated 2 years ago