sail-sg/optim4rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sail-sg/optim4rl)

sail-sg / optim4rl

Optim4RL is a Jax framework of learning to optimize for reinforcement learning.

☆28

Alternatives and similar repositories for optim4rl

Users that are interested in optim4rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qlan3 / Jaxplorer
View on GitHub
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
☆13Jul 19, 2024Updated 2 years ago
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
xtma / apo
View on GitHub
Average-Reward Reinforcement Learning with Trust Region Methods
☆11Oct 17, 2022Updated 3 years ago
sail-sg / hloenv
View on GitHub
an environment based on XLA for deep learning compiler optimization research.
☆24Mar 7, 2023Updated 3 years ago
ethanluoyc / magi
View on GitHub
Reinforcement learning library in JAX.
☆102Oct 22, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
frt03 / inference-based-rl
View on GitHub
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)
☆20Oct 25, 2021Updated 4 years ago
microsoft / segar
View on GitHub
Sandbox environment for generalizable agent research
☆27Aug 19, 2022Updated 3 years ago
google-deepmind / tell_me_why_explanations_rl
View on GitHub
☆37Apr 27, 2023Updated 3 years ago
jsikyoon / V-MPO_torch
View on GitHub
V-MPO torch version with DMLab30 and GTrXL
☆13Mar 1, 2021Updated 5 years ago
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆116Dec 5, 2023Updated 2 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
ademiadeniji / irm
View on GitHub
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆42Jan 13, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Underflow / reinforcement-2048
View on GitHub
A reinforcement learning algorithm for the 2048 game
☆20Mar 25, 2014Updated 12 years ago
sail-sg / jax_xc
View on GitHub
Exchange correlation functionals translated from libxc to jax
☆53Mar 24, 2025Updated last year
WJ2003B / mqe-release
View on GitHub
Official Release of Multistep Quasimetric Estimation (MQE)
☆18Mar 13, 2026Updated 4 months ago
instadeepai / fastpbrl
View on GitHub
Vectorization techniques for fast population-based training.
☆57Apr 26, 2026Updated 2 months ago
ucl-dark / skillhack
View on GitHub
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Oct 23, 2022Updated 3 years ago
Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
qlan3 / Explorer
View on GitHub
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆98Updated this week
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
Adeel-Abdullah / mpc-vsi
View on GitHub
A model predictive control based voltage source inverter
☆11Jan 11, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MasterXiong / ModuMorph
View on GitHub
Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023
☆15Aug 3, 2023Updated 2 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated 2 years ago
twitter-research / hyperbolic-rl
View on GitHub
☆60Sep 22, 2022Updated 3 years ago
Shen-Lab / LOIS
View on GitHub
[NeurIPS 2019] LOIS: Learning to Optimize In Swarms, guided by posterior estimation
☆18Aug 14, 2021Updated 4 years ago
RobertTLange / gymnax
View on GitHub
RL Environments in JAX 🌍
☆910Apr 2, 2026Updated 3 months ago
instadeepai / sebulba
View on GitHub
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
☆61Oct 23, 2023Updated 2 years ago
stacyste / TheoryOfMindInferenceModels
View on GitHub
☆28Nov 22, 2019Updated 6 years ago
astanic / crafter-ood
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
Improbable-AI / eipo
View on GitHub
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆83Apr 13, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
mohmdelsayed / HesScale
View on GitHub
Scalable Computation of Hessian Diagonals
☆14Jun 2, 2024Updated 2 years ago
twni2016 / pomdp-baselines
View on GitHub
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆348Apr 26, 2026Updated 2 months ago
danijar / ninjax
View on GitHub
General Modules for JAX
☆74Apr 7, 2026Updated 3 months ago
epignatelli / discovering-reinforcement-learning-algorithms
View on GitHub
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆23Dec 22, 2020Updated 5 years ago
frt03 / mxt_bench
View on GitHub
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)
☆14Feb 3, 2023Updated 3 years ago
automl / DACBench
View on GitHub
A benchmark library for Dynamic Algorithm Configuration.
☆38Updated this week