hendrycks / jiminy-cricket
Jiminy Cricket Environment (NeurIPS 2021)
☆24Updated 2 years ago
Alternatives and similar repositories for jiminy-cricket:
Users that are interested in jiminy-cricket are comparing it to the libraries listed below
- ☆38Updated 3 years ago
- ☆55Updated 4 years ago
- ☆34Updated 3 years ago
- ☆20Updated 4 months ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 5 years ago
- Redwood Research's transformer interpretability tools☆13Updated 2 years ago
- ☆24Updated 4 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆42Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆20Updated last year
- Scaling scaling laws with board games.☆45Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆94Updated last year
- Distilling Model Failures as Directions in Latent Space☆46Updated last year
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- A centralized place for deep thinking code and experiments☆79Updated last year
- A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.☆35Updated 2 weeks ago
- ☆24Updated 5 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?☆36Updated 2 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- ☆22Updated 2 years ago
- Measuring compositionality in representation learning☆72Updated 5 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆57Updated last year
- ☆58Updated 3 years ago
- ☆62Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆28Updated last year
- A library for efficient patching and automatic circuit discovery.☆48Updated 2 months ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago