Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 2 years ago
Alternatives and similar repositories for rosmo
Users that are interested in rosmo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of deep reinforcement learning algorithm implementations☆11Jan 9, 2020Updated 6 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- ☆16Jun 1, 2023Updated 2 years ago
- an environment based on XLA for deep learning compiler optimization research.☆24Mar 7, 2023Updated 3 years ago
- Reinforcement learning library in JAX.☆102Oct 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- Scalable metrics logging and analysis☆18Apr 9, 2025Updated last year
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- ☆17May 1, 2023Updated 3 years ago
- ☆18Aug 24, 2024Updated last year
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- ROS OMPL base planner☆13Feb 4, 2016Updated 10 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆30Apr 8, 2026Updated last month
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication fo…☆21Apr 23, 2024Updated 2 years ago
- Posted at AAAI 2023☆11Sep 4, 2025Updated 8 months ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".☆27Jul 6, 2023Updated 2 years ago
- Fast and reliable distributed systems in Python☆35Mar 25, 2026Updated 2 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony☆33Oct 16, 2023Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆932Dec 20, 2023Updated 2 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Matlab code for "Joint Projection Learning and Tensor Decomposition Based Incomplete Multi-view Clustering".☆10Jun 5, 2023Updated 2 years ago
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆61Oct 13, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Aug 30, 2024Updated last year
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆22Jun 24, 2023Updated 2 years ago
- Code for ACM MM 2023 paper - Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning☆14Jan 19, 2024Updated 2 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆30Sep 18, 2025Updated 8 months ago
- 游戏AI探索者☆16Jul 13, 2018Updated 7 years ago