understanding-search / maze-datasetLinks
maze datasets for investigating OOD behavior of ML systems
☆60Updated this week
Alternatives and similar repositories for maze-dataset
Users that are interested in maze-dataset are comparing it to the libraries listed below
Sorting:
- ☆103Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆175Updated 4 months ago
- Rewarded soups official implementation☆60Updated 2 years ago
- ☆106Updated 8 months ago
- Paper collections of the continuous effort start from World Models.☆185Updated last year
- Reinforcement Learning via Regressing Relative Rewards☆36Updated 10 months ago
- Code for Contrastive Preference Learning (CPL)☆176Updated 10 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆39Updated 11 months ago
- ☆73Updated last year
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆31Updated last year
- ☆131Updated last year
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆45Updated 11 months ago
- ☆54Updated 11 months ago
- Bootstrapping ARC☆142Updated 10 months ago
- Official release of the benchmark in paper "VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for…☆13Updated 2 months ago
- ☆63Updated 7 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆121Updated 6 months ago
- A library for efficient patching and automatic circuit discovery.☆77Updated 2 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆180Updated 6 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆23Updated last year
- ☆34Updated 7 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆141Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆123Updated last year
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆102Updated 2 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆201Updated last month
- ☆186Updated last year
- ☆108Updated 4 months ago
- ☆85Updated last year
- ☆69Updated 11 months ago