compsciencelab/ppo_D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/compsciencelab/ppo_D)

compsciencelab / ppo_D

This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https://arxiv.org/abs/2007.03328

☆19

Alternatives and similar repositories for ppo_D

Users that are interested in ppo_D are comparing it to the libraries listed below

Sorting:

fiberleif / POfD
View on GitHub
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
☆16Jun 5, 2019Updated 6 years ago
jaumpedro214 / traffic-flow-spark-kafka
View on GitHub
Testing Spark Structured Streaming anf Kafka with real data from traffic sensors
☆17Nov 11, 2022Updated 3 years ago
connieya / system_programming
View on GitHub
뇌를 자극하는 시스템 프로그래밍
☆13Mar 2, 2023Updated 3 years ago
flyingpeach / sls-code
View on GitHub
☆11Sep 18, 2025Updated 5 months ago
KarlXing / RL-Visual-Continuous-Control
View on GitHub
RL Algorithms for Visual Continuous Control
☆36May 31, 2023Updated 2 years ago
pfnet-research / capg
View on GitHub
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Jun 24, 2018Updated 7 years ago
hamirashkan / Predictive_DigitalTwin_WindFarm
View on GitHub
A feasible digital twin solution prototype for industry 4.0, which will be demonstrated for offshore wind farms through lab-based physica…
☆10Jun 6, 2022Updated 3 years ago
omerfaruktekin13 / HybridCarModelMATLABSimulinkSimscape
View on GitHub
Hybrid Car Model MATLAB Simulink Simscape
☆15Jul 27, 2023Updated 2 years ago
tjuHaoXiaotian / GASIL
View on GitHub
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆32Oct 9, 2018Updated 7 years ago
cychai1995 / DDPGfD
View on GitHub
DDPGfD: This is our implementation project for the Reinforcement Learning course in NCTU.
☆35Feb 13, 2022Updated 4 years ago
safe-dressing / safe_mpc_rss21
View on GitHub
☆11Apr 22, 2022Updated 3 years ago
YangsZzzi / MFAC
View on GitHub
Model-Free Adaptive Control
☆16Dec 11, 2020Updated 5 years ago
laris / Intel_Atom_D2550_Embedded_Motherboard
View on GitHub
Intel Atom D2550 Embedded Motherboard
☆13Dec 26, 2018Updated 7 years ago
LuoweiZhou / Negotiation-based-MARL
View on GitHub
Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"
☆13Dec 26, 2017Updated 8 years ago
RWTH-EBC / X-HD
View on GitHub
☆14May 25, 2022Updated 3 years ago
ryeii / CLUE
View on GitHub
Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.
☆11Feb 25, 2025Updated last year
ondrejdee / hungarian
View on GitHub
Hungarian algorithm for linear sum assignment. Works for square and rectangular matrices.
☆10May 16, 2017Updated 8 years ago
Stanford-ILIAD / Confidence-Aware-Imitation-Learning
View on GitHub
☆40Oct 30, 2021Updated 4 years ago
istat-methodology / TopicModelingLab
View on GitHub
A review of the most popular topic modeling techniques, featuring hands-on tutorials.
☆12Apr 29, 2025Updated 10 months ago
lasgroup / model-based-meta-rl
View on GitHub
☆12Oct 15, 2024Updated last year
SKSKSK94 / Distributed_SAC
View on GitHub
Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10
☆12Apr 6, 2022Updated 3 years ago
Philippos01 / mlops-energy-forecast-thesis
View on GitHub
Automated pipeline for energy consumption forecasting across Europe using Azure cloud and Databricks.
☆12Jul 7, 2023Updated 2 years ago
boschresearch / Hydraulic-EoL-Testing
View on GitHub
Multivariate Time Series Data usable for Time Series Segmentation and Time Series Classification. Each sample represents the multi-phased…
☆11Apr 20, 2024Updated last year
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 2 years ago
gqgit / IEEE-IV2021-numerically-stable-dynamic-bicycle-model
View on GitHub
numerically stable dynamic bicycle model for discrete time control, accepted by IEEE IV2021 workshop
☆10Sep 5, 2022Updated 3 years ago
stevenliu216 / ValetParking-MPC
View on GitHub
Implementing MPC for low speed parking lot scenarios (EECS561 Final Project)
☆10Jul 3, 2019Updated 6 years ago
larsvens / tamp_ws
View on GitHub
Traction adaptive motion planning using sampling augmented adaptive RTI
☆12Jun 6, 2021Updated 4 years ago
cap-ntu / iTCM-Datasets
View on GitHub
These datasets are collected for the GBIC project to conduct research about indoor human thermal comfort. The GBIC research project propo…
☆12Aug 5, 2020Updated 5 years ago
jjiantong / FastBO
View on GitHub
[CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification
☆11Jul 12, 2024Updated last year
dkarunakaran / scenario_extraction_framework
View on GitHub
☆10Feb 18, 2022Updated 4 years ago
PreCyseGroup / Data-Driven-ST-MPC
View on GitHub
This repository provides codes in MATLAB for computing data-driven backward reachable sets and set-theoretic model predictive control (ST…
☆10Aug 2, 2023Updated 2 years ago
YooSungHyun / attention-time-forecast
View on GitHub
attention으로 시계열 예측은 할 수 없을까
☆10Apr 30, 2021Updated 4 years ago
ProSeCo-Planning / ros_proseco_planning
View on GitHub
The ROS interface as well as the Python packages for ProSeCo Planning
☆10Jun 17, 2024Updated last year
resuldagdanov / self-improving-RL
View on GitHub
Codebase of paper "Self-Improving Safety Performance of Reinforcement Learning Based Driving with Black-Box Verification Algorithms" publ…
☆12Jul 13, 2023Updated 2 years ago
haotian-liu / transformers_llava
View on GitHub
☆16Apr 28, 2023Updated 2 years ago
noahbass / highlight-bot
View on GitHub
highlight bot is a platform that uses AI to automagically create highlight videos from live FIFA 20 gameplay when goals are scored. It's …
☆11Dec 8, 2022Updated 3 years ago
sriksmachi / supercabs
View on GitHub
Sample for training an agent which mimics a cab driver to gain maximum profits by picking the correct rides. The agent is trained using d…
☆10Jul 5, 2022Updated 3 years ago
cahaseler / AIQuickFix
View on GitHub
Instantly fix problems with ChatGPT AI. Use ChatGPT and GPT-4 AI tools to find one-click 'lightbulb menu' solutions to problems in your c…
☆12Mar 26, 2023Updated 2 years ago
STOL-AMS / TO-22-Lane-Changing
View on GitHub
This project developed a light-duty CAV lane-changing (LC) model with four components: car following (CF), mandatory and incentive-based …
☆12Sep 21, 2020Updated 5 years ago