sail-sg/rosmo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sail-sg/rosmo)

sail-sg / rosmo

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

☆30

Alternatives and similar repositories for rosmo

Users that are interested in rosmo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hwhitetooth / jax_muzero
View on GitHub
An implementation of MuZero in JAX.
☆58Nov 8, 2022Updated 3 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
sail-sg / hloenv
View on GitHub
an environment based on XLA for deep learning compiler optimization research.
☆24Mar 7, 2023Updated 3 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
ethanluoyc / magi
View on GitHub
Reinforcement learning library in JAX.
☆102Oct 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tilarids / reinforcement_learning_playground
View on GitHub
Playground for reinforcement learning algorithms implemented in TensorFlow
☆16Oct 18, 2016Updated 9 years ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
kelechi-c / dit_flow
View on GitHub
DiT (training + flow matching) in Jax
☆12Jan 5, 2025Updated last year
fidel-schaposnik / muzero
View on GitHub
Tensorflow implementation of MuZero algorithm
☆11Aug 23, 2022Updated 3 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
View on GitHub
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
YeWR / FICC
View on GitHub
☆17May 1, 2023Updated 3 years ago
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
procopiostein / ompl_planner_base
View on GitHub
ROS OMPL base planner
☆14Feb 4, 2016Updated 10 years ago
Jackory / RPBT
View on GitHub
(AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)
☆12May 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
LucasAlegre / mbcd
View on GitHub
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆11Aug 7, 2023Updated 2 years ago
gdikov / adversarial-variational-bayes
View on GitHub
A Keras/TensorFlow-based implementation of Adversarial Variational Bayes by L. Mescheder et al.
☆11Jul 1, 2017Updated 9 years ago
dlwh / jax_sourceror
View on GitHub
Turn jitted jax functions back into python source code
☆23Dec 16, 2024Updated last year
danijar / scope
View on GitHub
Scalable metrics logging and analysis
☆18Jun 30, 2026Updated 3 weeks ago
IQ250 / LeNet-by-Numpy
View on GitHub
The LeNet is realized by Numpy but not by TensorFlow, MXNet, Caffe or other tools. This work is for fundamental research like optimizatio…
☆12Feb 22, 2019Updated 7 years ago
frt03 / inference-based-rl
View on GitHub
Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)
☆20Oct 25, 2021Updated 4 years ago
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zchuning / repo
View on GitHub
Resilient Model-Based RL by Regularizing Posterior Predictability
☆22Mar 4, 2024Updated 2 years ago
raincchio / P3O
View on GitHub
Posted at AAAI 2023
☆11Sep 4, 2025Updated 10 months ago
sisl / BetaZero.jl
View on GitHub
Belief-state planning for POMDPs using learned approximations
☆25Jan 21, 2025Updated last year
daviddaytw / MIDI-MUG
View on GitHub
Play music game with MIDI keyboard!
☆15Sep 18, 2024Updated last year
Shengjiewang-Jason / EfficientZeroV2
View on GitHub
[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
☆120Aug 9, 2024Updated last year
CORE-Robotics-Lab / HetNet
View on GitHub
Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication fo…
☆21Jun 14, 2026Updated last month
thu-ml / CEURL
View on GitHub
Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)
☆19Oct 13, 2024Updated last year
facebookresearch / ede
View on GitHub
Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".
☆28Jul 6, 2023Updated 3 years ago
zhaoyi11 / dreamer-pytorch
View on GitHub
A pytorch implementation of Dreamer
☆25Mar 13, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jungm2018 / communications_neural_net
View on GitHub
Implementation of Neural Nets for Communications Channel Decoding using Log Likelihood Ratios
☆16Nov 19, 2020Updated 5 years ago
apexrl / COIL
View on GitHub
Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"
☆18Oct 21, 2022Updated 3 years ago
SS47816 / fiss_plus_planner
View on GitHub
[IROS 2023] FISS+: Efficient and Focused Trajectory Generation and Refinement using Fast Iterative Search and Sampling Strategy
☆59Mar 21, 2024Updated 2 years ago
yukara-ikemiya / minimal-sqvae
View on GitHub
A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony
☆33Oct 16, 2023Updated 2 years ago
avillaflor / SPLT-transformer
View on GitHub
☆18Jul 10, 2022Updated 4 years ago
YeWR / EfficientZero
View on GitHub
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
☆939Dec 20, 2023Updated 2 years ago
frt03 / generalized_dt
View on GitHub
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆70Aug 8, 2022Updated 3 years ago