google / werewolf_arena
☆13Updated 8 months ago
Alternatives and similar repositories for werewolf_arena:
Users that are interested in werewolf_arena are comparing it to the libraries listed below
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- ☆12Updated 7 months ago
- ☆15Updated 6 months ago
- 🧮 Algebraic Positional Encodings.☆11Updated 2 months ago
- ☆15Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆14Updated last year
- ☆9Updated 3 weeks ago
- Learn online intrinsic rewards from LLM feedback☆35Updated 3 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 4 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- this is for fun, ain't it grand!☆14Updated 11 months ago
- ☆18Updated 11 months ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆15Updated last week
- Repository for "Toward Artificial Open-Ended Evolution within Lenia using Quality-Diversity" (ALIFE 2024).☆20Updated 8 months ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Updated 2 years ago
- CycleQD is a framework for parameter space model merging.☆35Updated last month
- ☆25Updated 9 months ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆15Updated last week
- Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)☆17Updated 9 months ago
- Clean RL implementation using MLX☆28Updated last year
- ☆31Updated 2 years ago
- Collection of LLM completions for reasoning-gym task datasets☆15Updated this week