An implementation of PPO in Pytorch
☆123May 17, 2026Updated last week
Alternatives and similar repositories for ppo
Users that are interested in ppo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆68May 16, 2026Updated last week
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 3 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated last month
- ✨🌲 Hierarchical extreme multiclass and multi-label classification.☆18Jan 5, 2023Updated 3 years ago
- Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta☆30May 18, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hash-routed Networks☆20Nov 20, 2020Updated 5 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Jan 21, 2025Updated last year
- 🤖 Creation of an RL environment with Unity, where an agent must learn to survive by moving 🦿 and shooting🔫, using ML-Agents !☆19Oct 11, 2021Updated 4 years ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Sep 23, 2024Updated last year
- A multi-agent environment using Unity ML-Agents Toolkit☆10Dec 9, 2020Updated 5 years ago
- Explorations into NEAT and some of its derivative research☆37Apr 17, 2026Updated last month
- Contextual knowledge bases☆25Jun 30, 2022Updated 3 years ago
- Axial Positional Embedding for Pytorch☆84Feb 25, 2025Updated last year
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆71Apr 21, 2026Updated last month
- ☆14May 6, 2025Updated last year
- Python package for emotion analysis in French☆16Jun 25, 2021Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆59Jun 23, 2021Updated 4 years ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆154May 2, 2025Updated last year
- Implementation of Kronecker Attention in Pytorch☆20Sep 12, 2020Updated 5 years ago
- Robust Reinforcement Learning Suite☆37Dec 24, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- 4th place solution to datafactory challenge by Intermarché.☆12Jun 28, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Toy environment set for multi-agent reinforcement learning and more☆39Nov 26, 2024Updated last year
- Implementation of Infini-Transformer in Pytorch☆112Jan 4, 2025Updated last year
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆135Jul 6, 2024Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆79Apr 3, 2026Updated last month
- Toy genetic algorithm in Pytorch☆56Apr 21, 2026Updated last month
- Official repository for the TMLR paper "Self-Improvement for Neural Combinatorial Optimization: Sample Without Replacement, but Improveme…☆30Jan 22, 2026Updated 4 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Aug 3, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without New Demonstrations", from USC / Amazon Robotics☆35Aug 15, 2025Updated 9 months ago
- Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules☆58Dec 2, 2020Updated 5 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 5 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Implementation of Agent Attention in Pytorch☆93Jul 10, 2024Updated last year
- 🦀 Online statistics in Rust☆74Jul 31, 2025Updated 9 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆137May 5, 2026Updated 2 weeks ago