google-deepmind / simulation_streams
Simulation Streams is a programming paradigm designed to efficiently control and leverage Large Language Models (LLMs) for complex, dynamic simulations and agentic workflows.
☆17Updated 2 weeks ago
Alternatives and similar repositories for simulation_streams:
Users that are interested in simulation_streams are comparing it to the libraries listed below
- SocialJax: sequential social dilemma environments☆16Updated this week
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- Simple JAX Graphics Library.☆35Updated 4 months ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆51Updated 2 years ago
- ☆74Updated last week
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Highly scalable 2D JAX physics engine.☆53Updated 3 weeks ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆12Updated 3 weeks ago
- Efficient baselines for autocurricula in JAX.☆186Updated 7 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- A collection of matrix games in JAX☆10Updated 4 months ago
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆46Updated 3 months ago
- ☆70Updated last year
- Learn online intrinsic rewards from LLM feedback☆35Updated 3 months ago
- Simplest and Cleanest DreamerV3 implementation out there☆50Updated last week
- IIG-RL-Benchmark is a library for training and evaluating game theoretical or deep RL algorithms on OpenSpiel games.☆12Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU setting☆150Updated last week
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆22Updated 4 months ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated 11 months ago
- ☆20Updated 9 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated 8 months ago
- ☆13Updated 8 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 5 months ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆85Updated 2 weeks ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆95Updated 5 months ago
- Exploitability calculation for imperfect-information game benchmarks☆23Updated last month
- ☆14Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆71Updated 7 months ago