Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow
☆20Oct 5, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_phasic_policy_gradient
Users that are interested in reinforcement_learning_phasic_policy_gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Nov 21, 2022Updated 3 years ago
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 7 months ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆57Nov 10, 2025Updated 7 months ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 3 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Unity Chat system including audio chat, video chat and text chat through photon, socket and firebase, however where user can use this plu…☆13Jan 12, 2021Updated 5 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Mar 13, 2026Updated 3 months ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- C++软件开发工程师面试学习笔记☆12Oct 2, 2020Updated 5 years ago
- Humanoid behavior imitation using Generative Adversarial Imitation Learning (GAIL)☆16Jul 1, 2020Updated 5 years ago
- Reinforcement Learning Projects☆19Mar 27, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Vision-based robotic arm grasping using deep reinforcement learning☆28Aug 19, 2023Updated 2 years ago
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Demos of reinforcement learning on Simulation of Urban MObility☆26Nov 23, 2019Updated 6 years ago
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 5 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 6 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆19Feb 9, 2021Updated 5 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].☆16Nov 4, 2020Updated 5 years ago
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆25Jun 25, 2025Updated 11 months ago
- Unofficial implementation of Variational Diffusion Models in PyTorch (Lightning)☆12Aug 31, 2023Updated 2 years ago
- ☆15Dec 3, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Mar 2, 2023Updated 3 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer☆25Aug 14, 2019Updated 6 years ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆39May 20, 2026Updated 3 weeks ago
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆18Nov 14, 2023Updated 2 years ago