Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_phasic_policy_gradient
Users that are interested in reinforcement_learning_phasic_policy_gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 6 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 6 months ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 3 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- C++软件开发工程师面试学习笔记☆12Oct 2, 2020Updated 5 years ago
- This project explores deep reinforcement learning, hybrid actor-critic approach with A3C/PPO combined with curiosity for playing Super M…☆82Jan 19, 2019Updated 7 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)☆13Mar 4, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 6 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆19Feb 9, 2021Updated 5 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- Synchronous memory pipe for Rust☆33Nov 28, 2020Updated 5 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- scalable vnf placement algorithm (laboratory research)☆28Dec 21, 2021Updated 4 years ago
- Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].☆16Nov 4, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Collection of reinforcement learning algorithms implementations with TensorFlow2☆14Sep 28, 2024Updated last year
- ☆15Dec 3, 2024Updated last year
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- ☆17Mar 2, 2023Updated 3 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 6 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆25Aug 14, 2019Updated 6 years ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆38May 20, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Mar 14, 2016Updated 10 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆11Jan 27, 2025Updated last year
- Solving Physics Puzzles by Reasoning about Paths (NeurIPS 2020 workshop)☆14Jun 28, 2022Updated 3 years ago
- a recommendation list of math courses for people with no math background.☆11Mar 2, 2021Updated 5 years ago
- Code accompanying paper, Forward Prediction for Physical Reasoning☆11Oct 12, 2021Updated 4 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago