Proximal Policy Optimization (Continuous Version) in PyTorch.
☆29May 12, 2025Updated 9 months ago
Alternatives and similar repositories for Continuous-PPO
Users that are interested in Continuous-PPO are comparing it to the libraries listed below
Sorting:
- Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)☆23Jul 16, 2022Updated 3 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- My Submission for the OpenAI/NeurIPS ProcGen Competition☆11Nov 12, 2020Updated 5 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38May 16, 2023Updated 2 years ago
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.☆37Mar 24, 2023Updated 2 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Code for magnetic mirror descent.☆17Oct 5, 2023Updated 2 years ago
- A Gym env for propulsive rocket landing.☆22Jun 7, 2022Updated 3 years ago
- Web application where humans can play Overcooked with AI agents.☆60Dec 6, 2022Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆106Dec 31, 2021Updated 4 years ago
- Stanford CS234: Reinforcement Learning Winter 2020☆19Mar 24, 2023Updated 2 years ago
- a modular reinforcement learning library with JAX agents☆27Mar 3, 2025Updated last year
- [AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a…☆28Jul 28, 2025Updated 7 months ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Stanford CS234: Reinforcement Learning assignments and practices☆63Jul 31, 2024Updated last year
- The core repository of the elsciRL framework.☆18Dec 8, 2025Updated 2 months ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Distributed Deep Reinforcement Learning☆30Jan 21, 2021Updated 5 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆37Jul 5, 2023Updated 2 years ago
- Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment☆117Dec 8, 2022Updated 3 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆207Sep 15, 2024Updated last year
- Distributed RL framework for solving the SoulsGym environments☆33Apr 28, 2024Updated last year
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆149Nov 15, 2021Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch fra…☆143Nov 21, 2022Updated 3 years ago
- ☆35Nov 22, 2024Updated last year
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- About The dataset was recorded on the Husky robotics platform on the university campus and consists of 5 tracks recorded at different tim…☆11Mar 25, 2025Updated 11 months ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- Waterbody style transfer of underwater imagery (JOE 2025)☆26Dec 12, 2025Updated 2 months ago