aogara-ds / hoodwinked
Text-based game of lies and deceit, made for language models.
☆29Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hoodwinked
- ☆18Updated last month
- ☆78Updated 11 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆93Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆46Updated 2 weeks ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆72Updated 10 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- A toolkit for describing model features and intervening on those features to steer behavior.☆106Updated last week
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆83Updated last week
- LLM Agora, debating between open-source LLMs to refine the answers☆52Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆61Updated last week
- ☆72Updated last year
- ☆87Updated 9 months ago
- ☆103Updated last month
- ☆41Updated 2 weeks ago
- ☆90Updated 4 months ago
- Generate High Quality textual or multi-modal datasets with Agents☆17Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Measuring the situational awareness of language models☆33Updated 9 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- ☆62Updated 3 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆42Updated 3 months ago
- ☆74Updated 3 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆162Updated last month
- GoldFinch and other hybrid transformer components☆39Updated 4 months ago