Proximal Policy Optimization (PPO) algorithm for Contra
☆143Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for Contra-PPO-pytorch
Users that are interested in Contra-PPO-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Optimization (PPO) algorithm for Super Mario Bros☆1,273Jul 24, 2021Updated 4 years ago
- Character-level CNN for text classification☆56Dec 26, 2021Updated 4 years ago
- Reinforcement Learning attempts to beat Contra 3 for the SNES☆14Feb 16, 2019Updated 7 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- ☆33Nov 11, 2013Updated 12 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis☆31Jan 9, 2019Updated 7 years ago
- Very deep CNN for text classification☆37Dec 26, 2021Updated 4 years ago
- Curiosity-driven Exploration by Self-supervised Prediction for Street Fighter III Third Strike☆164Nov 11, 2019Updated 6 years ago
- Implementation of SNAIL(A Simple Neural Attentive Meta-Learner) with Gluon☆12Feb 22, 2019Updated 7 years ago
- This project applies multiple deep learning models to the problem of restoring diacritical marks to sentences in Vietnamese.☆26Nov 13, 2018Updated 7 years ago
- Versions of hybrid pso algorithms for engineering optimization☆10Dec 21, 2017Updated 8 years ago
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆21Apr 8, 2021Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Oct 4, 2020Updated 5 years ago
- Benchmark dataset for maritime multi-sensor, multi-target tracking☆14Apr 19, 2022Updated 3 years ago
- Xây dựng chương trình xây dựng bộ stopwords tiếng việt dựa trên IDF sử dụng scikit-learn☆22Apr 23, 2019Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- runs the emulator that communicates with the weplay io backend☆25Feb 22, 2017Updated 9 years ago
- A model predictive control based voltage source inverter☆11Jan 11, 2020Updated 6 years ago
- Vision-Based Navigation for Auto-Docking☆14Apr 21, 2021Updated 4 years ago
- docker scripts to build and run a minimal version of TDengine☆10Jul 17, 2019Updated 6 years ago
- This submission demonstrates modeling and simulation of a Two-Zone MVDC electric ship in Simscape Electrical, and considers modeling con…☆10Dec 14, 2022Updated 3 years ago
- Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros☆1,107Apr 28, 2024Updated last year
- The goal of this design is to use the PYNQ-Z2 development board to design a general convolution neural network accelerator. And through r…☆11Sep 30, 2020Updated 5 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- Average-Reward Reinforcement Learning with Trust Region Methods☆11Oct 17, 2022Updated 3 years ago
- A template for using Leptos ssr + server functions in a Cloudflare worker☆18Nov 2, 2024Updated last year
- visual studio code extension for TDengine☆10Mar 21, 2023Updated 3 years ago
- Web RT☆13May 25, 2017Updated 8 years ago
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- A collection of Fault Diagnosis python codes☆10Mar 13, 2022Updated 4 years ago
- Webots simulation environment and a vision-based autonomous docking algorithm for robotic vessels with a novel latching system.☆15Oct 8, 2024Updated last year
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- Incremental Passive Fault-Tolerant Control for Quadrotors With up to Three Successive Rotor Failures☆12May 10, 2025Updated 10 months ago
- Relevant codes of the paper ``Adaptive Parameterized Model Predictive Control Based on Reinforcement Learning: A Synthesis Framework"☆16Mar 4, 2024Updated 2 years ago
- Compare PyTorch models from MATLAB using co-execution☆14Sep 29, 2023Updated 2 years ago
- Word2vec for Truyen Kieu☆18Jan 1, 2024Updated 2 years ago
- Based in my last research paper - 2021☆10Dec 9, 2023Updated 2 years ago