rddy / mimiLinks

Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"

☆25

Alternatives and similar repositories for mimi

Users that are interested in mimi are comparing it to the libraries listed below

Sorting:

Miffyli / rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
☆15Updated 4 years ago
kampta / PatchGame
PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021
☆23Updated 4 years ago
shyamsn97 / controllable-ncas
Code for "Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems"
☆56Updated 3 years ago
yuqingd / cusp
☆15Updated 2 years ago
google-deepmind / affordances_option_models
☆23Updated 3 years ago
riveSunder / carle
Cellular Automata Reinforcement Learning Environment.
☆9Updated 11 months ago
ElisevanderPol / PRAE
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
☆30Updated 5 years ago
EleutherAGI / summarisation
The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…
☆12Updated 4 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated 11 months ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
shoaibahmed / metadata_archaeology
Official code for the paper: "Metadata Archaeology"
☆19Updated 2 years ago
Kajiyu / kanerva_machine
The implementation of "The Kanerva Machine" with Pytorch and Pyro
☆12Updated 7 years ago
vikashplus / unitree_sim
MuJoCo models for Unitree Robots
☆12Updated 3 years ago
SamuelSchmidgall / EvolutionarySelfReplication
Produce intelligence by means of natural selection without objective/reward optimization
☆14Updated 3 years ago
icaros-usc / MarioGAN-LSI
An experimental setup for running quality diversity algorithms on GAN latent spaces.
☆22Updated 5 years ago
enlite-ai / maze_smaac
Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze
☆10Updated 3 years ago
PAL-ML / PEARL_v1
☆30Updated 3 years ago
facebookresearch / cascade
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
☆29Updated 2 years ago
lucidrains / metaformer-gpt
Implementation of Metaformer, but in an autoregressive manner
☆25Updated 3 years ago
siddharthverma314 / clcp-neurips-2020
Code for Continual Learning of Control Primitives
☆18Updated 4 years ago
enajx / HyperNCA
☆39Updated 3 years ago
schrum2 / GameGAN
Interactive GAN evolution of Mario and Zelda levels.
☆54Updated last year
ThomasMiconi / Meta-Task-Generator
Automatically generate simple meta-learning tasks from a very large space
☆15Updated last year
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
ToruOwO / mimex
PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"
☆16Updated 2 years ago
kvfrans / powderworld
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆68Updated 10 months ago
HumanCompatibleAI / deep-rlsp
Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.
☆26Updated 4 years ago
crowsonkb / dice-mc
DiCE: The Infinitely Differentiable Monte-Carlo Estimator
☆31Updated last year
IndustAI / learning-group-structure
Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"
☆15Updated 2 years ago
pbaylies / clustering-laion400m
Script and models for clustering LAION-400m CLIP embeddings.
☆26Updated 3 years ago