ArnaudFickinger / adversarial-surpriseLinks
Explore and Control with Adversarial Surprise
☆10Updated 3 years ago
Alternatives and similar repositories for adversarial-surprise
Users that are interested in adversarial-surprise are comparing it to the libraries listed below
Sorting:
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 4 years ago
- ☆18Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆39Updated 7 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆39Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆23Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆26Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 3 years ago
- ☆31Updated last year
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆16Updated last year
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆13Updated last year
- ☆19Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆17Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Action Value Gradient Algorithm☆21Updated last month
- ☆22Updated last year
- ☆35Updated 3 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆18Updated 4 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 6 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆49Updated last year
- My Body Is A Cage☆41Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year