An implementation of PPO in Pytorch
☆106Jan 7, 2026Updated 2 months ago
Alternatives and similar repositories for ppo
Users that are interested in ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 2 years ago
- ✨🌲 Hierarchical extreme multiclass and multi-label classification.☆18Jan 5, 2023Updated 3 years ago
- Hash-routed Networks☆20Nov 20, 2020Updated 5 years ago
- 🤖 Creation of an RL environment with Unity, where an agent must learn to survive by moving 🦿 and shooting🔫, using ML-Agents !☆19Oct 11, 2021Updated 4 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Oct 4, 2020Updated 5 years ago
- Axial Positional Embedding for Pytorch☆84Feb 25, 2025Updated last year
- Contextual knowledge bases☆24Jun 30, 2022Updated 3 years ago
- A multi-agent environment using Unity ML-Agents Toolkit☆10Dec 9, 2020Updated 5 years ago
- Tidy up your machine learning experiments☆17Sep 5, 2019Updated 6 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆26Feb 2, 2025Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- Python package for emotion analysis in French☆16Jun 25, 2021Updated 4 years ago
- 4th place solution to datafactory challenge by Intermarché.☆12Jun 28, 2021Updated 4 years ago
- Toy environment set for multi-agent reinforcement learning and more☆39Nov 26, 2024Updated last year
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆135Oct 15, 2025Updated 5 months ago
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data☆17Apr 3, 2025Updated 11 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Sep 10, 2020Updated 5 years ago
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆67Apr 7, 2025Updated 11 months ago
- Autoregressive Bayesian linear model☆21Sep 10, 2020Updated 5 years ago
- Knowledge Base Embedding By Cooperative Knowledge Distillation☆67Oct 2, 2022Updated 3 years ago
- Regard is a self-hosted tool written in Rust and React that tracks the time you spend working on specific projects and displays it using …☆84Jul 13, 2023Updated 2 years ago
- Training Agents in a cooperative multi-agent deep reinforcement learning setting to transport objects across a space☆14Jul 5, 2021Updated 4 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆55May 12, 2025Updated 10 months ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Unofficial PyTorch Implementation of OpenAI's GPT-3☆13Apr 11, 2022Updated 3 years ago
- Reinforcement Learning via Regressing Relative Rewards☆40Dec 12, 2024Updated last year
- Implementation of Flash Attention in Jax☆227Mar 1, 2024Updated 2 years ago
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆15Jun 20, 2025Updated 9 months ago
- WIP☆36Jul 29, 2024Updated last year
- Clean RL implementation using MLX☆34Mar 8, 2024Updated 2 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- 🎲 Iterable dataset resampling in PyTorch☆91Dec 15, 2021Updated 4 years ago
- Code for "Unsupervised Visuomotor Control through Distributional Planning Networks"☆10Jun 27, 2019Updated 6 years ago
- Development and evaluation of different approaches for fibre tracking of diffusion weighted MRI data.☆10May 9, 2022Updated 3 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago