samiranrl / Carrom_rl
A Pygame+Pymunk Carrom Simulation Testbed for reinforcement learning. [CS747][ Foundations of Intelligent and Learning Agents]
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Carrom_rl
- Direct C++ Interface to PyTorch☆80Updated 6 years ago
- Code for the blog post on few-shot classification via task representation and communication.☆18Updated 7 years ago
- ☆32Updated 8 years ago
- TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.☆44Updated 6 years ago
- ☆15Updated 7 years ago
- Model-Free Episodic Control☆15Updated 7 years ago
- Deterministic Policy Gradient using torch7☆44Updated 8 years ago
- Convert a Caffe Model to a Theano Model☆11Updated 9 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Updated 7 years ago
- Models built with TensorFlow☆25Updated 5 years ago
- Our NIPS 2017: Learning to Run source code☆56Updated last year
- PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)☆79Updated 7 years ago
- Backpropagation training of neural networks with Hebbian plastic connections☆30Updated 3 years ago
- ☆20Updated 7 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- ☆29Updated 7 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆22Updated 7 years ago
- Implementation of Residual Learning with Stochastic Depth http://arxiv.org/pdf/1603.09382v2.pdf☆10Updated 8 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Updated 6 years ago
- Playing Atari games with TensorFlow implementation of Asynchronous Deep Q-Learning☆43Updated 6 years ago
- ☆22Updated 7 years ago
- A Caffe implementation of http://arxiv.org/abs/1512.07928☆40Updated 8 years ago
- This is an implimentation of Value Iteration Networks (NIPS2016 best paper) in keras☆18Updated 6 years ago
- Distributed A3C☆34Updated 6 years ago
- some RL algorithms☆19Updated 7 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆43Updated 6 years ago