nicklashansen / svea-vit
Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"
☆17Updated last year
Alternatives and similar repositories for svea-vit:
Users that are interested in svea-vit are comparing it to the libraries listed below
- ☆47Updated last year
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆23Updated 3 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- ☆26Updated last year
- Code for Policy Consolidation for Continual Reinforcement Learning☆10Updated 5 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆16Updated 3 years ago
- ☆39Updated 3 years ago
- ☆53Updated last year
- ☆54Updated 11 months ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Updated 2 years ago
- ☆17Updated 3 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- ☆14Updated last year
- ☆46Updated 2 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆31Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆98Updated 8 months ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated 2 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- ☆31Updated 3 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆17Updated 2 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆19Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆34Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- ☆42Updated 3 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 9 months ago