TMats / surveyLinks
Summary of Paper Survey
☆15Updated 5 years ago
Alternatives and similar repositories for survey
Users that are interested in survey are comparing it to the libraries listed below
Sorting:
- ☆16Updated 8 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Updated 3 years ago
- Implementation of Grounded Language Learning in a 3D Simulated World (DeepMind)☆34Updated 7 years ago
- Supporting code for the paper 'Learning to generate classifiers'.☆18Updated 7 years ago
- This is a self-contained memory module for the Dynamic Kanerva Machine, as reported in the NIPS 2018 paper: Learning Attractor Dynamics f…☆43Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆27Updated 6 years ago
- AI論文読みメモ☆26Updated 8 years ago
- SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆54Updated 7 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆23Updated 7 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code for "Boosted Generative Models", AAAI 2018.☆20Updated 7 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Updated 7 years ago
- DQN implementation in Keras + TensorFlow + OpenAI Gym☆46Updated 7 years ago
- A fast implementation of Neural Image Caption by Chainer☆16Updated 6 years ago
- Leaning hard attention model by policy gradient with rewards based on active inference.☆23Updated 7 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Keras implementation of Curiosity-driven Exploration by Self-supervised Prediction☆8Updated 7 years ago
- ☆17Updated 7 years ago
- Jupyter notebooks for Chainer hands-on☆24Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Updated 7 years ago
- ☆56Updated 6 years ago
- Scripts to generate the CoDraw and i-CLEVR datasets used for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Gen…☆39Updated 2 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10Updated 7 years ago
- Code for the COG dataset and network☆43Updated 6 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago