A clean, modular implementation of the Proximal Policy Optimization (PPO) algorithm in PyTorch, written with a strong focus on readability and educational value, as well as performance.
☆19Feb 7, 2026Updated last month
Alternatives and similar repositories for simple-ppo
Users that are interested in simple-ppo are comparing it to the libraries listed below
Sorting:
- Companion code release to "Bayesian Optimization of Function Networks", published in NeurIPS 2021.☆11Jan 12, 2025Updated last year
- Official repository of "Minibatch optimal transport distances; analysis and applications" (https://arxiv.org/pdf/2101.01792.pdf)☆10Oct 15, 2021Updated 4 years ago
- Commonsense Scene Graph-based Target Localization for Object Search☆15Apr 2, 2024Updated last year
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 5 months ago
- The Automata Learning Framework☆19May 20, 2020Updated 5 years ago
- [ICLR 2021] Few Shot Bayesian Optimization☆22Oct 17, 2022Updated 3 years ago
- Automatic Metric for Evaluating Generated Videos☆34Dec 8, 2025Updated 3 months ago
- Community Implementation of *Temporal Latent Auto-Encoder* as described in [Temporal Latent Auto-Encoder: A Method for Probabilistic Mult…☆15Jun 9, 2022Updated 3 years ago
- ☆24Jul 17, 2024Updated last year
- 基于ChatGLM2带的openai_api.py修改支持ChatGLM3。☆19Oct 31, 2023Updated 2 years ago
- Dota 2 replay knowledge in book form.☆27Apr 30, 2014Updated 11 years ago
- The official implementation of PFNs4BO: In-Context Learning for Bayesian Optimization☆40Sep 18, 2025Updated 6 months ago
- Implementation of "Single-pass stratified importance resampling"☆30Jul 30, 2022Updated 3 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- ☆26Jan 3, 2025Updated last year
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆86Nov 27, 2023Updated 2 years ago
- Implementation of VAE and Style-GAN Architecture Achieving State of the Art Reconstruction☆29Mar 24, 2023Updated 2 years ago
- A Golang client for FalkorDB☆17Updated this week
- Training DIAMOND to play MarioKart64 in a Neural Network.☆30Sep 9, 2025Updated 6 months ago
- learning notes when learning the source code of pytorch☆24Apr 3, 2019Updated 6 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- CS4246 course summaries☆20Nov 11, 2018Updated 7 years ago
- Reinforcement learning algorithms A2C, A3C and DQN☆18Oct 3, 2023Updated 2 years ago
- A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal, but extensible training loop …☆193Feb 27, 2026Updated 3 weeks ago
- ☆11Feb 9, 2024Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- GRPC client CLI, like grpcurl, but in Rust; GRPC Client UI, like postman, but in Rust☆21Jan 29, 2026Updated last month
- Tracking the latest and greatest research papers on text-to-image generation.☆62Updated this week
- A dataset for multi-object multi-actor activity parsing☆41Sep 29, 2023Updated 2 years ago
- ☆14May 3, 2022Updated 3 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation. CVPR 2022☆36Oct 27, 2022Updated 3 years ago
- Example implementation of Zeebe workflows using pyzeebe.☆12Jun 1, 2021Updated 4 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- Perf monitoring CLI tool for Apple Silicon☆10Jan 25, 2023Updated 3 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- LLM Proxy☆12Aug 26, 2024Updated last year
- ☆50Jun 7, 2025Updated 9 months ago