Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_phasic_policy_gradient
Users that are interested in reinforcement_learning_phasic_policy_gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Nov 21, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 5 months ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 2 years ago
- Reinforcement Learning tool for Network Slice Placement problems☆28May 4, 2024Updated 2 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated last month
- ☆49Apr 22, 2013Updated 13 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Reinforcement Learning Projects☆19Mar 27, 2024Updated 2 years ago
- Humanoid behavior imitation using Generative Adversarial Imitation Learning (GAIL)☆16Jul 1, 2020Updated 5 years ago
- ☆13Oct 2, 2024Updated last year
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆82Jan 19, 2019Updated 7 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 3 years ago
- Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)☆13Mar 4, 2024Updated 2 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 6 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Feb 22, 2023Updated 3 years ago
- scalable vnf placement algorithm (laboratory research)☆28Dec 21, 2021Updated 4 years ago
- Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].☆16Nov 4, 2020Updated 5 years ago
- Collection of reinforcement learning algorithms implementations with TensorFlow2☆14Sep 28, 2024Updated last year
- ☆15Dec 3, 2024Updated last year
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- ☆17Mar 2, 2023Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 6 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆25Aug 14, 2019Updated 6 years ago
- A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.☆21Nov 6, 2019Updated 6 years ago
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …☆15Jul 6, 2021Updated 4 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago