nacloos / baba-is-ai
☆23Updated this week
Related projects: ⓘ
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆13Updated 2 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆20Updated 2 weeks ago
- ☆11Updated 2 months ago
- ☆56Updated 3 weeks ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆41Updated 3 months ago
- Reinforcement Learning inside a 3D soccer simulation☆19Updated this week
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆75Updated 4 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Contains JAX implementation of algorithms for inverse reinforcement learning☆59Updated last month
- ☆34Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆17Updated 3 weeks ago
- GPT implementation in Flax☆18Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆19Updated 3 months ago
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆34Updated last year
- ☆12Updated 11 months ago
- Repo to reproduce the First-Explore paper results☆36Updated last year
- General Modules for JAX☆57Updated last month
- ☆28Updated 5 months ago
- Scalable Opponent Shaping Experiments in JAX☆19Updated 5 months ago
- ☆17Updated 3 months ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆63Updated 2 weeks ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆20Updated 2 months ago
- Clean RL implementation using MLX☆26Updated 6 months ago
- ☆40Updated 2 months ago
- PyTorch Package For Quasimetric Learning☆38Updated last year
- ☆20Updated last year
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆25Updated 7 months ago
- ☆15Updated 2 years ago
- ☆25Updated this week
- An Open-Ended Agentic Simulator☆17Updated last month