RyanNavillus/reward-surfaces

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RyanNavillus/reward-surfaces)

RyanNavillus / reward-surfaces

☆19

Alternatives and similar repositories for reward-surfaces

Users that are interested in reward-surfaces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
luchris429 / JaxLife
View on GitHub
An Open-Ended Agentic Simulator
☆61Aug 11, 2024Updated last year
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
huterguier / lox
View on GitHub
Logging library for JAX that is compatible with transformations and primitives such as vmap and scan.
☆16Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
hsp-iit / fast-ycb
View on GitHub
The Fast-YCB Dataset
☆18Nov 2, 2023Updated 2 years ago
Steven-Ho / VALOR
View on GitHub
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Jun 28, 2019Updated 7 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
View on GitHub
A2C is a special case of PPO!
☆23May 20, 2022Updated 4 years ago
Artur-Galstyan / jaxonloader
View on GitHub
A dataloader, but for JAX
☆20May 17, 2024Updated 2 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
clvrai / new-actions-rl
View on GitHub
☆24Aug 9, 2024Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
MichaelTMatthews / purejaxgcrl
View on GitHub
GCRL in JAX. Official repository for LEO (ICML 2026).
☆28Jun 20, 2026Updated last month
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
entity-neural-network / entity-gym
View on GitHub
Standard interface for entity based reinforcement learning environments.
☆39Feb 28, 2024Updated 2 years ago
omron-sinicx / ShinRL
View on GitHub
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)
☆50Feb 15, 2022Updated 4 years ago
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
riiswa / pointax
View on GitHub
Pointax: PointMaze Environment for JAX
☆28Oct 22, 2025Updated 9 months ago
automl / arlbench
View on GitHub
HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient
☆32Jun 16, 2026Updated last month
eilab-gt / NovGrid
View on GitHub
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆34May 21, 2024Updated 2 years ago
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆73Apr 26, 2026Updated 2 months ago
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
RPegoud / jym
View on GitHub
JAX implementation of RL algorithms and vectorized environments
☆50Dec 26, 2023Updated 2 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
seohongpark / LSD
View on GitHub
Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)
☆39Jun 3, 2023Updated 3 years ago
MMintLab / VIRDO
View on GitHub
Github repository of a Visio-tactile Implicit Representations of Deformable Objects (ICRA 2022)
☆27Nov 1, 2023Updated 2 years ago
facebookresearch / bipedal-skills
View on GitHub
Bipedal Skills Benchmark for Reinforcement Learning
☆26Oct 27, 2022Updated 3 years ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
dunnolab / laom
View on GitHub
Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025
☆38Jul 8, 2025Updated last year
zdhNarsil / Stochastic-Marginal-Actor-Critic
View on GitHub
Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".
☆24Feb 9, 2023Updated 3 years ago