andy-psai / MountainCar_ActorCriticView external linksLinks
TD Advantage Actor-Critic RL algorithm
☆15Mar 19, 2019Updated 6 years ago
Alternatives and similar repositories for MountainCar_ActorCritic
Users that are interested in MountainCar_ActorCritic are comparing it to the libraries listed below
Sorting:
- Implementations of solutions to the continuous mountain car problem. Using OpenAI Gym and Tensorflow 1.1.☆11Jan 29, 2018Updated 8 years ago
- ☆11Oct 2, 2020Updated 5 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Implimenting DDPG Algorithm in Tensorflow-2.0☆10Mar 25, 2023Updated 2 years ago
- ☆15Feb 23, 2025Updated 11 months ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12May 13, 2020Updated 5 years ago
- Holds docker images and run scripts for BobbleBot simulation environment.☆12May 18, 2019Updated 6 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago
- Conflict avoidance algorithm for unmanned aircraft traffic management☆10May 30, 2017Updated 8 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Procedural object generation for robotic manipulation☆11Oct 6, 2018Updated 7 years ago
- A step-by-step fully automated script to stand up a single-server instance of Open edX release Ginkgo.1 running on an AWS EC2 R3.Large in…☆18Feb 13, 2018Updated 8 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- Word Embeddings, RNNs, Language Modelling, LSTMs, Sequence to sequence, Attention, Transformers, BERT☆19Jun 10, 2021Updated 4 years ago
- Leap Motion SDK - Python 3 module builder☆17Aug 15, 2020Updated 5 years ago
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Mar 21, 2024Updated last year
- A detailed conversion of a C++ project to Python using pybind11☆18Oct 1, 2021Updated 4 years ago
- a place for randos online☆14Jul 17, 2020Updated 5 years ago
- This repo contains various data structures and algorithms problems solved using Python☆18Nov 21, 2022Updated 3 years ago
- Python implementation of algorithm proposed in paper "Autonomous On-Demand Free Flight Operations in Urban Air Mobility using Monte Carlo…☆16May 10, 2021Updated 4 years ago
- "SinGAN : Learning a Generative Model from a Single Natural Image" in TensorFlow 2☆17Oct 9, 2020Updated 5 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Sep 11, 2023Updated 2 years ago
- ☆15Sep 14, 2020Updated 5 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Oct 6, 2021Updated 4 years ago
- Integrate torsional springs into your Gazebo simulation.☆19Jan 30, 2019Updated 7 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Solution to the OpenAI Gym environment of the MountainCar through Deep Q-Learning☆23Dec 2, 2018Updated 7 years ago
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆21May 26, 2022Updated 3 years ago
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- use tensorflow to implement the MADDPG(simple_tag)☆18Jan 7, 2018Updated 8 years ago
- ForgER algorithm☆23Oct 3, 2022Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- LSTM based human activity recognition using smart phone sensor dataset☆22Jun 17, 2017Updated 8 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Emergent collective intelligence from massive-agent cooperation and competition☆27Jan 9, 2023Updated 3 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago