ArnaudFickinger / adversarial-surprise
Explore and Control with Adversarial Surprise
☆9Updated 3 years ago
Related projects: ⓘ
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆24Updated 2 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆14Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆21Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago
- ☆16Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆21Updated 5 months ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- ☆20Updated 4 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆33Updated last year
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 2 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆14Updated 4 months ago
- ☆15Updated 11 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆13Updated last year
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆18Updated 3 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆14Updated 3 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" ICLR 2022☆13Updated 2 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆17Updated 2 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆11Updated 4 months ago
- ☆14Updated 2 years ago
- ☆33Updated last year
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆29Updated last year
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆21Updated last year
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆32Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year
- ☆20Updated 5 months ago
- Modular Single-file Reinfocement Learning Algorithms Library☆37Updated last year
- ☆28Updated 6 months ago