brownirl / lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆13Updated 2 months ago
Related projects: ⓘ
- Reinforcement Learning inside a 3D soccer simulation☆19Updated this week
- Scalable Opponent Shaping Experiments in JAX☆19Updated 5 months ago
- ☆11Updated 2 months ago
- An Open-Ended Agentic Simulator☆17Updated last month
- ☆25Updated this week
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆13Updated last year
- Docker containers of baseline agents for the Crafter environment☆27Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …