☆24Oct 22, 2015Updated 10 years ago
Alternatives and similar repositories for deeprlhw2
Users that are interested in deeprlhw2 are comparing it to the libraries listed below
Sorting:
- ☆10Mar 10, 2021Updated 4 years ago
- Optimal Transport and Optimization related experiments.☆10Jul 22, 2018Updated 7 years ago
- A curated lists of self-taught materials including research blogs☆16Dec 12, 2016Updated 9 years ago
- Repo containing to-dos and instructions for DRL in POMDPs.jl☆11Jun 21, 2016Updated 9 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Jul 17, 2019Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- This repository contains the game bubble shooter as a gym environment. Based on: https://github.com/justinmeister/bubbleshooter☆17Mar 30, 2020Updated 5 years ago
- ☆25Apr 16, 2024Updated last year
- ☆19Apr 25, 2016Updated 9 years ago
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆53Feb 16, 2020Updated 6 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Feb 15, 2023Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Implementation of TRPO and related algorithms☆647May 20, 2018Updated 7 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆32Apr 15, 2019Updated 6 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Aug 1, 2017Updated 8 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- NeurIPS 2018: AI for Prosthetics Challenge – 3rd place solution☆32Oct 15, 2019Updated 6 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- ☆31Nov 21, 2018Updated 7 years ago
- ☆10Feb 13, 2025Updated last year
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 8 years ago
- Collaborative Deep Reinforcement Learning☆32Jul 29, 2017Updated 8 years ago
- Controlling steering wheel of a car in simulator☆27Jan 8, 2017Updated 9 years ago
- Assignments for CS294-112.☆1,651Mar 24, 2023Updated 2 years ago
- A parallel version of Trust Region Policy Optimization☆65Mar 6, 2017Updated 8 years ago
- Add-on package to gym, to record sequences of actions, observations, and rewards☆75Apr 2, 2023Updated 2 years ago
- Multitask Learning with Pretrained Transformers☆40Mar 20, 2021Updated 4 years ago
- A Python program, running as an independent process, that provides a 'proxy like' service for experiment runtimes ( psychopy ) and device…☆19May 8, 2013Updated 12 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- C++ library to work with Iso8583 messages☆10Sep 22, 2018Updated 7 years ago
- enuSpace plugin for Tensorflow (graphical logic block, flow programming)☆11Feb 6, 2020Updated 6 years ago
- ☆20May 24, 2025Updated 9 months ago
- Repository for Manning Twitch session about building and deploying APIs with Python☆12Jul 19, 2021Updated 4 years ago
- Update metadata (titles, authors, publications, etc.) of selected entries in Zotero☆11Aug 19, 2024Updated last year
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- Tool for technical analysis of financial data about companies indexed on the stockmarket using machine learning☆11Sep 6, 2017Updated 8 years ago
- ☆11May 13, 2021Updated 4 years ago