rddy / mimi
Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"
☆25Updated 2 years ago
Alternatives and similar repositories for mimi:
Users that are interested in mimi are comparing it to the libraries listed below
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆64Updated 4 months ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆26Updated 3 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- Shared MuJoCo simulation scenes and assets for ROBEL environments.☆12Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆29Updated 4 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 3 years ago
- Code for "Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems"☆55Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆32Updated 9 months ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆12Updated 3 years ago
- ☆15Updated 2 years ago
- ☆28Updated 2 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 3 years ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Code for Continual Learning of Control Primitives☆18Updated 4 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆21Updated 8 months ago
- Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning☆24Updated 2 years ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆39Updated last year
- Implementation of the Belief State Encoder / Decoder in the new breakthrough robotics paper from ETH Zürich☆65Updated 2 years ago
- ☆16Updated 3 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- OpenAI gym environment for evolving morphologies of 2D virtual creatures.☆33Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆19Updated 2 months ago
- ☆27Updated 4 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- Intepretability method to find what navigation agents learn☆17Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year