vcadillog / PPO-Mario-Bros-Tensorflow-2View external linksLinks
A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.
☆21Nov 6, 2019Updated 6 years ago
Alternatives and similar repositories for PPO-Mario-Bros-Tensorflow-2
Users that are interested in PPO-Mario-Bros-Tensorflow-2 are comparing it to the libraries listed below
Sorting:
- The project is advised by Professor Robert Engle in his FINANCIAL ECONOMETRICS PhD course. I made comparison between the performance of d…☆10Sep 14, 2018Updated 7 years ago
- ☆16Feb 4, 2026Updated last week
- ☆13May 14, 2017Updated 8 years ago
- Unity extensions for Mono☆12Jun 11, 2018Updated 7 years ago
- URDF description of the JVRC humanoid model☆15Jan 9, 2025Updated last year
- Model-free policy gradient algorithm for LQR☆10Apr 8, 2020Updated 5 years ago
- A networked lobby system for FishNet☆11Mar 16, 2023Updated 2 years ago
- Portfolio Optimisation is a fundamental problem in Financial Mathematics.The objective of this project is to explore the applicability of…☆13Nov 10, 2020Updated 5 years ago
- ☆10Sep 14, 2016Updated 9 years ago
- Unity iOS Sample project☆11Aug 29, 2014Updated 11 years ago
- JVRC1 model files for MuJoCo☆10Apr 8, 2025Updated 10 months ago
- Manifold-based-algorithm to solve problems with constant modulus constraints.☆15Jan 2, 2020Updated 6 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Official Implementation of SFM and the baselines in Jax.☆20May 31, 2025Updated 8 months ago
- Implementation of various reinforcement learning algorithms in examples obtained from the book "Reinforcement Learning: An Introduction, …☆10Feb 7, 2022Updated 4 years ago
- Temporally Correlated Episodic Reinforcement Learning, ICLR 24☆12Apr 8, 2024Updated last year
- Research project on applying deep reinforcement learning to perform financial market predictions. A competitive market maker.☆13Dec 8, 2022Updated 3 years ago
- 2 algorithms of optimal trade execution: 1) Dynamic Programming 2) Frank-Wolfe Algorithm (Python & C++)☆18Dec 11, 2019Updated 6 years ago
- RL implementation in course 'Control for Robotics' at U of T, includes Generalized Policy Iteration, Monte Carlo and Q-Learning in MATLAB☆14Jul 16, 2022Updated 3 years ago
- Code accompanying the latent-action-priors paper.☆12Mar 5, 2025Updated 11 months ago
- 关于书《强化学习第二版》(作者Richard S. Sutton)每章节的代码实现(matlab版)☆16Nov 6, 2019Updated 6 years ago
- A development guide for building custom robot assemblies in Solidworks, converting them to URDF, importing to pybullet environment, and s…☆12Aug 25, 2020Updated 5 years ago
- 利用链家统计的上海二手房数据,进行简单数据分析,以及用线性回归对房价进行预测☆17Jan 15, 2020Updated 6 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Vpin caculation and backtesting☆14Aug 16, 2019Updated 6 years ago
- A reinforcement learning implementation for Assetto Corsa. (Bachelor project TCS 2023)☆18Jan 24, 2024Updated 2 years ago
- Python scripts for trajectory optimization method (iterative LQG)☆11Feb 17, 2016Updated 9 years ago
- A* Algorithm in Julia☆14Jan 4, 2026Updated last month
- A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input☆13Feb 28, 2019Updated 6 years ago
- Implementation in Matlab☆13Jul 9, 2020Updated 5 years ago
- Bayesian Estimation of the GARCH(1,1) Model with Student-t Innovations☆16May 16, 2021Updated 4 years ago
- Model predictive path integral control in jax☆16Feb 8, 2026Updated last week
- SDP Code for Distributionally Robust Optimization Technique☆11Aug 25, 2018Updated 7 years ago
- Reinforcement Learning (RL) Course in MATLAB with exercises and solutions☆18Jul 30, 2021Updated 4 years ago
- Collection of reinforcement learning algorithms implementations with TensorFlow2☆14Sep 28, 2024Updated last year
- some RPG save☆16Nov 2, 2025Updated 3 months ago
- GentleHumanoid: Whole Body Motion Tracking with Compliance - Training☆39Dec 17, 2025Updated last month
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- A pytorch implementation of the paper "Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images", NIPS, 2015☆15Sep 30, 2019Updated 6 years ago