coding examples to Intro to RL
☆13Apr 30, 2018Updated 7 years ago
Alternatives and similar repositories for intro-to-rl
Users that are interested in intro-to-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Simple, small, fully-connected Python version of NeoRL☆11Jan 29, 2016Updated 10 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆11Aug 11, 2016Updated 9 years ago
- TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)☆10Mar 1, 2018Updated 8 years ago
- An attempt at implementing ideas in "Learning to Transduce with Unbounded Memory" (http://arxiv.org/abs/1506.02516)☆11Jul 27, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆44Jul 31, 2015Updated 10 years ago
- ☆11Jul 24, 2025Updated 8 months ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 8 years ago
- Code for a generative controller for the AI Gym cartpole task☆15Feb 22, 2017Updated 9 years ago
- Python3 reimplementation of Wissner-Gross & Freer, 2013☆15Dec 18, 2025Updated 3 months ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)☆11Feb 9, 2023Updated 3 years ago
- Example of a Variational-Autoencoder using Theano blocks☆12Jun 16, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Adaptive Memory Prediction Framework☆15Apr 19, 2015Updated 10 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 9 years ago
- Implementation of Deep Variational Bayes Filter☆13Aug 9, 2019Updated 6 years ago
- Code from posts at AlgorthmicAlley.com☆14Nov 27, 2019Updated 6 years ago
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- Hierarchical Encoder Decoder for Dialog Modelling☆16May 20, 2015Updated 10 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆54Jul 25, 2016Updated 9 years ago
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- ☆24Oct 22, 2015Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Bootcamp held for Spring 17.☆10Mar 14, 2017Updated 9 years ago
- ☆16Jan 13, 2023Updated 3 years ago
- Implementation of condnets☆16Apr 21, 2016Updated 9 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0☆14Mar 19, 2018Updated 8 years ago
- Backprop training of recurrent neural networks with Hebbian plastic connections☆20Jun 30, 2021Updated 4 years ago
- Comparison between Sarsa and Q-Learning algorithms on risk handling☆17Jul 10, 2017Updated 8 years ago
- Transporter implementation in PyTorch☆20Jul 24, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code for the following paper(arXiv link): Improved Actor Relation Graph based Group Activity Recognition Zijian Kuang, Xinran Tie☆15Jan 19, 2022Updated 4 years ago
- General experiments on Vanilla RNN and LSTM in Theano.☆16Aug 23, 2015Updated 10 years ago
- Train an RL agent to play multiple Atari games at once☆69Jun 6, 2016Updated 9 years ago
- A Pygame+Pymunk Carrom Simulation Testbed for reinforcement learning. [CS747][ Foundations of Intelligent and Learning Agents]☆15Jun 24, 2019Updated 6 years ago
- Projective Simulation☆18Jul 27, 2018Updated 7 years ago
- Generative Sparse Distributed Representations, a fast generative model written in Python (Original C++ implementation https://github.com/…☆24Jul 24, 2016Updated 9 years ago
- Lasagne / Theano tutorials for Nvidia Deep Learning Summercamp 2016☆26Sep 29, 2016Updated 9 years ago