stephmilani / multiagent-viperLinks
☆10Updated 3 years ago
Alternatives and similar repositories for multiagent-viper
Users that are interested in multiagent-viper are comparing it to the libraries listed below
Sorting:
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Updated 4 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Updated 5 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Updated 4 years ago
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆13Updated 2 years ago
- ☆11Updated 3 weeks ago
- ☆11Updated 4 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 4 years ago
- Public implementation of "Encoding Human Domain Knowledge to Warm Start Reinforcement Learning" from AAAI'21☆20Updated last year
- MBRL library in JAX☆12Updated 3 years ago
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆13Updated 3 years ago
- ☆32Updated 4 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 6 months ago
- ☆16Updated 4 years ago
- Implementation of "POPCORN: Partially Observed Prediction Constrained Reinforcement Learning" (Futoma, Hughes, Doshi-Velez, AISTATS 2020)☆11Updated 4 years ago
- Representation Learning in RL☆13Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆14Updated last year
- on-policy optimization baselines for deep reinforcement learning☆32Updated 5 years ago
- ☆17Updated 3 years ago
- ☆19Updated 2 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Updated 3 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆20Updated 7 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Updated 3 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Updated 4 years ago
- Public code for implementation and experiments with differentiable decision trees.☆32Updated last year
- ☆13Updated last year
- ☆32Updated 4 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Updated 3 years ago