Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_phasic_policy_gradient
Users that are interested in reinforcement_learning_phasic_policy_gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of PPO in Pytorch☆106Jan 7, 2026Updated 2 months ago
- ☆30Nov 21, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆55Nov 10, 2025Updated 4 months ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Unity Chat system including audio chat, video chat and text chat through photon, socket and firebase, however where user can use this plu…☆12Jan 12, 2021Updated 5 years ago
- Reinforcement Learning tool for Network Slice Placement problems☆28May 4, 2024Updated last year
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration☆17Jan 7, 2021Updated 5 years ago
- Humanoid behavior imitation using Generative Adversarial Imitation Learning (GAIL)☆16Jul 1, 2020Updated 5 years ago
- Joint placement and scaling of bidirectional network services with stateful virtual or physical network functions☆30Jun 29, 2020Updated 5 years ago
- 图神经网络课程——图注意力网络☆11Dec 28, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 2 years ago
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 4 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 7 years ago
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 5 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Synchronous memory pipe for Rust☆31Nov 28, 2020Updated 5 years ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆37Nov 17, 2020Updated 5 years ago
- scalable vnf placement algorithm (laboratory research)☆28Dec 21, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Collection of reinforcement learning algorithms implementations with TensorFlow2☆14Sep 28, 2024Updated last year
- ☆15Dec 3, 2024Updated last year
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- 基于图注意力模型(GAT)的交通网络流量预测☆16Apr 16, 2022Updated 3 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- ☆17Mar 2, 2023Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆24Aug 14, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …☆16Jul 6, 2021Updated 4 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- [ICC '21 - DRL-SFCP] Implementation of our paper "DRL-SFCP: Adaptive Service Function Chains Placement with Deep Reinforcement Learning",…☆45May 10, 2025Updated 10 months ago