karthikncode / Grounded-RL-TransferLinks
☆13Updated 6 years ago
Alternatives and similar repositories for Grounded-RL-Transfer
Users that are interested in Grounded-RL-Transfer are comparing it to the libraries listed below
Sorting:
- Solving reinforcement learning tasks which require language and vision☆33Updated 2 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆26Updated 5 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Learning with latent language☆51Updated 4 years ago
- PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper☆26Updated 3 years ago
- SeqGAN but with more bells and whistles☆24Updated 7 years ago
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆73Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 4 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆135Updated last year
- Dataset and documentation for paper on explaining solutions to physical reasoning tasks (ESPRIT))☆21Updated 3 months ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆116Updated 5 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- Systematic generalization test for CLEVR☆15Updated 5 years ago
- Measuring compositionality in representation learning☆73Updated 6 years ago
- Generalised UDRL☆37Updated 3 years ago
- Z Forcing: Training Stochastic RNN's, NIPS'17☆32Updated 7 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Phy-Q: A Testbed for Physical Reasoning☆44Updated last year
- Variational Reinforcement Learning☆16Updated last year
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 4 years ago
- ☆54Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- ☆25Updated 6 years ago
- Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…☆32Updated 4 years ago
- E2C implementation in PyTorch☆43Updated 8 years ago
- ☆31Updated 6 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated last year
- Stein Variational Policy Gradient for REINFORCE☆18Updated 8 years ago