Chris-hughes10 / simple-ppoLinks
A clean, modular implementation of the Proximal Policy Optimization (PPO) algorithm in PyTorch, written with a strong focus on readability and educational value, as well as performance.
☆18Updated last year
Alternatives and similar repositories for simple-ppo
Users that are interested in simple-ppo are comparing it to the libraries listed below
Sorting:
- A batched implementation for efficient Qwen2.5-VL inference.☆19Updated 3 months ago
- An implementation of PPO in Pytorch☆97Updated this week
- code associated with paper "Sparse Bayesian Optimization"☆26Updated 2 years ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆271Updated 7 months ago
- A Torch Based RL Framework for Rapid Prototyping of Research Papers☆70Updated 3 months ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆38Updated last week
- Gradient Boosting Reinforcement Learning (GBRL)☆122Updated this week
- 💻 As a Frontend Development Intern at Shen AI (Aug – Oct 2024), I built the company website using React.js and worked with the design te…☆14Updated 5 months ago
- On-Policy Policy Gradient Algorithms in JAX☆40Updated last year
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated 8 months ago
- A parallel ODE solver for PyTorch☆270Updated last year
- TorchOpt is an efficient library for differentiable optimization built upon PyTorch.☆616Updated this week
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated last year
- 📈Implementing the ADAM optimizer from the ground up with PyTorch and comparing its performance on six 3-D objective functions (each prog…☆21Updated 3 years ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆288Updated 7 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Updated 2 years ago
- Online Decision Transformer☆272Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 4 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆141Updated 6 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆107Updated 2 weeks ago
- Collect optimizer related papers, data, repositories☆99Updated 11 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- fast + parallel AlphaZero in JAX☆104Updated 10 months ago
- Reinforcement Learning Algorithms Tutorial (Python) from scratch (Mar 2021)☆201Updated this week
- Deep Q Networks☆88Updated 7 years ago
- Fast reinforcement learning 💨☆28Updated 3 months ago
- A State-Space Model with Rational Transfer Function Representation.☆82Updated last year
- ☆73Updated last year
- Explorations into improving ViTArc with Slot Attention☆43Updated last year