sukhijab / maxinforl_torch
☆34Updated last month
Alternatives and similar repositories for maxinforl_torch:
Users that are interested in maxinforl_torch are comparing it to the libraries listed below
- PWM: Policy Learning with Large World Models☆39Updated 4 months ago
- ☆29Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆25Updated 3 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆61Updated 9 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆40Updated last month
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆12Updated 7 months ago
- Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation☆63Updated last year
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆44Updated this week
- ☆14Updated last month
- ☆18Updated last month
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆27Updated last month
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- ☆21Updated 9 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆26Updated 2 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆34Updated 10 months ago
- ☆41Updated 5 months ago
- Jax/Flax Implementation of TD-MPC2☆51Updated this week
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆63Updated 7 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 9 months ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆9Updated last year
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated 11 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆76Updated 9 months ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆67Updated last year
- JAX implementation of WSRL and RL baselines☆18Updated last week
- Code for Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations☆65Updated last year
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated 2 years ago
- speed-running solving robot manipulation tasks☆20Updated 2 months ago