apple / ml-uwacLinks
☆35Updated 3 years ago
Alternatives and similar repositories for ml-uwac
Users that are interested in ml-uwac are comparing it to the libraries listed below
Sorting:
- ☆32Updated 10 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 7 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆26Updated last year
- ☆31Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- ☆48Updated last year
- ☆42Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 4 years ago
- ☆26Updated 2 years ago
- ☆16Updated 3 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- ☆53Updated last year
- on-policy optimization baselines for deep reinforcement learning☆30Updated 5 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆17Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 3 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆25Updated 3 years ago
- ☆56Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- ☆41Updated 3 years ago
- ☆31Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆24Updated 2 years ago
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆24Updated 2 years ago
- ☆17Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆101Updated last year
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago