Baichenjia / Contrastive-UCBView external linksLinks
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
☆11Jun 16, 2022Updated 3 years ago
Alternatives and similar repositories for Contrastive-UCB
Users that are interested in Contrastive-UCB are comparing it to the libraries listed below
Sorting:
- ☆13May 21, 2023Updated 2 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Sep 15, 2021Updated 4 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Jan 13, 2024Updated 2 years ago
- Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)☆23May 12, 2023Updated 2 years ago
- ☆27Mar 20, 2024Updated last year
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 2 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆21Sep 16, 2024Updated last year
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Apr 6, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Deep Hierarchical Planning from Pixels☆114Dec 21, 2022Updated 3 years ago
- Implantation of CtrlFormer☆27Oct 17, 2022Updated 3 years ago
- Seoul AI Gym is a toolkit for developing AI algorithms.☆31Dec 15, 2018Updated 7 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Imitation and relaxation reinforcement learning☆29Sep 26, 2022Updated 3 years ago
- ☆30Feb 20, 2021Updated 4 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆30Oct 5, 2022Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Feb 6, 2023Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆38Mar 1, 2021Updated 4 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 2 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Apr 13, 2023Updated 2 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- ☆14Jul 4, 2022Updated 3 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago
- Skype bot written in Java with some X11 magic☆18Mar 16, 2016Updated 9 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Jul 5, 2023Updated 2 years ago
- An unofficial implementation for online decision transformer☆41Sep 20, 2022Updated 3 years ago
- [NeurIPS 2023] Latent Exploration for Reinforcement Learning☆44Feb 23, 2024Updated last year
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- phase-based Observations, Rewards, Coupling Ablation☆49Mar 7, 2024Updated last year
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 2 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago