Proximal Policy Optimization (Continuous Version) in PyTorch.
☆27May 12, 2025Updated last year
Alternatives and similar repositories for Continuous-PPO
Users that are interested in Continuous-PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆10Nov 12, 2020Updated 5 years ago
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆23Jul 16, 2022Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 3 years ago
- The core repository of the elsciRL framework.☆18Dec 8, 2025Updated 6 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 3 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- The Laser Learning Environment (LLE) is a cooperative MARL grid-world☆13Updated this week
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 4 years ago
- A Gym env for propulsive rocket landing.☆23Jun 7, 2022Updated 4 years ago
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago
- Fork of https://github.com/xbpeng/DeepMimic☆14Sep 10, 2020Updated 5 years ago
- [AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a…☆31Jul 28, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Apr 26, 2022Updated 4 years ago
- 此项目创建的初衷是为了帮助人工智能、自然语言处理和大语言模型相关背景的同学找工作使用,欢迎加入项目的建设和维护☆18Mar 30, 2025Updated last year
- A collection of matrix games in JAX☆14Apr 13, 2026Updated 2 months ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- Distributed Deep Reinforcement Learning☆30Jan 21, 2021Updated 5 years ago
- Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.☆35Apr 11, 2021Updated 5 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 3 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Mar 24, 2023Updated 3 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).☆12Feb 9, 2025Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Feb 25, 2023Updated 3 years ago
- Code for IEEE transactions on neural networks and learning system☆13Jun 18, 2021Updated 5 years ago
- Code developed for Robothon Challenge 2023☆14Sep 2, 2024Updated last year
- ☆10Jul 28, 2023Updated 2 years ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- Code for magnetic mirror descent.☆19Oct 5, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Value-Decomposition Multi-Agent Actor-Critics☆42Dec 8, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The project to learn the QMIX.☆13Dec 19, 2019Updated 6 years ago
- High-performance tokenized language data-loader for Python C++ extension☆15Jul 22, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- ☆13Aug 15, 2020Updated 5 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Dec 31, 2021Updated 4 years ago
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago