This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https://arxiv.org/abs/2007.03328
☆19Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for ppo_D
Users that are interested in ppo_D are comparing it to the libraries listed below
Sorting:
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- Testing Spark Structured Streaming anf Kafka with real data from traffic sensors☆17Nov 11, 2022Updated 3 years ago
- 뇌를 자극하는 시스템 프로그래밍☆13Mar 2, 2023Updated 3 years ago
- ☆11Sep 18, 2025Updated 5 months ago
- RL Algorithms for Visual Continuous Control☆36May 31, 2023Updated 2 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- A feasible digital twin solution prototype for industry 4.0, which will be demonstrated for offshore wind farms through lab-based physica…☆10Jun 6, 2022Updated 3 years ago
- Hybrid Car Model MATLAB Simulink Simscape☆15Jul 27, 2023Updated 2 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.☆35Feb 13, 2022Updated 4 years ago
- ☆11Apr 22, 2022Updated 3 years ago
- Model-Free Adaptive Control☆16Dec 11, 2020Updated 5 years ago
- Intel Atom D2550 Embedded Motherboard☆13Dec 26, 2018Updated 7 years ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- ☆14May 25, 2022Updated 3 years ago
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.☆11Feb 25, 2025Updated last year
- Hungarian algorithm for linear sum assignment. Works for square and rectangular matrices.☆10May 16, 2017Updated 8 years ago
- ☆40Oct 30, 2021Updated 4 years ago
- A review of the most popular topic modeling techniques, featuring hands-on tutorials.☆12Apr 29, 2025Updated 10 months ago
- ☆12Oct 15, 2024Updated last year
- Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10☆12Apr 6, 2022Updated 3 years ago
- Automated pipeline for energy consumption forecasting across Europe using Azure cloud and Databricks.☆12Jul 7, 2023Updated 2 years ago
- Multivariate Time Series Data usable for Time Series Segmentation and Time Series Classification. Each sample represents the multi-phased…☆11Apr 20, 2024Updated last year
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 2 years ago
- numerically stable dynamic bicycle model for discrete time control, accepted by IEEE IV2021 workshop☆10Sep 5, 2022Updated 3 years ago
- Implementing MPC for low speed parking lot scenarios (EECS561 Final Project)☆10Jul 3, 2019Updated 6 years ago
- Traction adaptive motion planning using sampling augmented adaptive RTI☆12Jun 6, 2021Updated 4 years ago
- These datasets are collected for the GBIC project to conduct research about indoor human thermal comfort. The GBIC research project propo…☆12Aug 5, 2020Updated 5 years ago
- [CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification☆11Jul 12, 2024Updated last year
- ☆10Feb 18, 2022Updated 4 years ago
- This repository provides codes in MATLAB for computing data-driven backward reachable sets and set-theoretic model predictive control (ST…☆10Aug 2, 2023Updated 2 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- The ROS interface as well as the Python packages for ProSeCo Planning☆10Jun 17, 2024Updated last year
- Codebase of paper "Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms" publ…☆12Jul 13, 2023Updated 2 years ago
- ☆16Apr 28, 2023Updated 2 years ago
- highlight bot is a platform that uses AI to automagically create highlight videos from live FIFA 20 gameplay when goals are scored. It's …☆11Dec 8, 2022Updated 3 years ago
- Sample for training an agent which mimics a cab driver to gain maximum profits by picking the correct rides. The agent is trained using d…☆10Jul 5, 2022Updated 3 years ago
- Instantly fix problems with ChatGPT AI. Use ChatGPT and GPT-4 AI tools to find one-click 'lightbulb menu' solutions to problems in your c…☆12Mar 26, 2023Updated 2 years ago
- This project developed a light-duty CAV lane-changing (LC) model with four components: car following (CF), mandatory and incentive-based …☆12Sep 21, 2020Updated 5 years ago