ccnets-team / causal-rl
Causal RL: Reverse-Environment Network Integrated Actor-Critic Algorithm
☆27Updated 3 months ago
Related projects: ⓘ
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆15Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆23Updated 2 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆11Updated last month
- ML2-Multi Agent Environments☆33Updated 8 months ago
- Official repo for Offline RL for Online RL☆15Updated 11 months ago
- Yet Another PyTorch Tutorial☆11Updated 3 years ago
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆52Updated 3 years ago
- Yet Another Reinforcement Learning Tutorial☆71Updated last year
- Distributed Priortized Experience Replay☆10Updated 6 years ago
- MBRL library in JAX☆9Updated last year
- ☆63Updated 11 months ago
- ☆21Updated 5 months ago
- ☆17Updated last year
- ☆10Updated this week
- Awesome Model-based RL Papers☆9Updated 3 years ago
- ☆30Updated last month
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Updated 8 months ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆26Updated last year
- ☆36Updated last year
- A simple example of randomized ensembled double q learning☆17Updated 3 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆10Updated 2 years ago
- ☆41Updated 5 months ago
- ☆28Updated 6 months ago
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆16Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆35Updated 2 weeks ago
- ☆20Updated 4 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆51Updated 5 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆33Updated last week
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year