rddy / mimi
Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"
☆23Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for mimi
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- ☆15Updated 2 years ago
- Shared MuJoCo simulation scenes and assets for ROBEL environments.☆12Updated 4 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 3 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16Updated last year
- Gradient-based constrained optimization for JAX☆26Updated 2 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆11Updated 3 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- Cross-Domain Imitation Learning via Optimal Transport☆23Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- ☆22Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆20Updated 6 months ago
- GPT implementation in Flax☆18Updated 2 years ago
- Code for "Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems"☆50Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 3 months ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆27Updated 4 years ago
- Cellular Automata Reinforcement Learning Environment.☆9Updated 3 months ago
- Implementations of Curious Replay for model-based adaptation.☆36Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- speed-running solving robot manipulation tasks☆18Updated last week
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)☆15Updated 4 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆63Updated 2 months ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Official code for "Task-Embedded Control Networks for Few-Shot Imitation Learning".☆44Updated 4 years ago
- Code for Continual Learning of Control Primitives☆18Updated 4 years ago
- Intepretability method to find what navigation agents learn☆18Updated 2 years ago
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated last year
- [ICLR 2021] Beyond Categorical Label Representations for Image Classification☆25Updated 2 years ago