Chris-hughes10 / simple-ppoLinks
A clean, modular implementation of the Proximal Policy Optimization (PPO) algorithm in PyTorch, written with a strong focus on readability and educational value, as well as performance.
☆19Updated last year
Alternatives and similar repositories for simple-ppo
Users that are interested in simple-ppo are comparing it to the libraries listed below
Sorting:
- Implementation of Agent Attention in Pytorch☆93Updated last year
- ☆63Updated 2 months ago
- High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.☆19Updated 4 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆121Updated this week
- An implementation of PPO in Pytorch☆101Updated 3 weeks ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆120Updated last month
- The Packing problem has gained much relevance with the recent upheaval of the delivery and retail industry. Companies all over the world …☆11Updated 4 years ago
- Reinforcement Learning Algorithms Tutorial (Python) from scratch (Mar 2021)☆203Updated this week
- Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation☆356Updated 3 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆60Updated this week
- On-Policy Policy Gradient Algorithms in JAX☆41Updated last year
- ☆35Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆274Updated 9 months ago
- Large Language Model Evolutionary Algorithm☆84Updated last week
- ☆34Updated last year
- code associated with paper "Sparse Bayesian Optimization"☆26Updated 2 years ago
- ☆23Updated 10 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆130Updated last month
- FlashRNN - Fast RNN Kernels with I/O Awareness☆173Updated 2 months ago
- Code for [ICML2025]``Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design``.☆67Updated 6 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆34Updated 5 months ago
- Implementation of Diffusion Transformer Model in Pytorch☆71Updated 7 months ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- RND1: Scaling Diffusion Language Models☆166Updated 3 weeks ago
- A State-Space Model with Rational Transfer Function Representation.☆83Updated last year
- ☆55Updated 10 months ago
- Diffusion model derived evolutionary algorithm☆238Updated 6 months ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆292Updated 8 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆149Updated 9 months ago