rddy / mimi
Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"
☆25Updated 2 years ago
Alternatives and similar repositories for mimi:
Users that are interested in mimi are comparing it to the libraries listed below
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 3 years ago
- Repository for "Toward Artificial Open-Ended Evolution within Lenia using Quality-Diversity" (ALIFE 2024).☆22Updated 2 weeks ago
- ☆15Updated 2 years ago
- paper on dexpilot☆15Updated 5 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆26Updated 3 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆23Updated 3 years ago
- MuJoCo models for Unitree Robots☆12Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆66Updated 7 months ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Updated 3 years ago
- Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"☆13Updated 3 years ago
- 'I didn’t want to imitate anybody. Any movement I knew, I didn’t want to use.' – Pina Bausch☆43Updated last year
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has bee…☆16Updated last year
- An implementation of the Augmented Random Search algorithm☆14Updated 3 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆21Updated 11 months ago
- Generalised UDRL☆37Updated 2 years ago
- ☆28Updated 2 years ago
- ☆22Updated 3 years ago
- ☆23Updated 3 years ago
- ☆33Updated 7 months ago
- ☆21Updated 4 years ago
- ☆38Updated 2 years ago
- ☆27Updated 4 years ago
- PyTorch implementation of DARLA preprocessing models☆11Updated 7 years ago
- A CLIP conditioned Decision Transformer.☆22Updated 3 years ago
- Official repository of Action-Free Guide☆11Updated 2 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated 10 months ago