aogara-ds / hoodwinked
Text-based game of lies and deceit, made for language models.
☆31Updated last year
Alternatives and similar repositories for hoodwinked
Users that are interested in hoodwinked are comparing it to the libraries listed below
Sorting:
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆75Updated last year
- Measuring the situational awareness of language models☆34Updated last year
- Functional Benchmarks and the Reasoning Gap☆86Updated 7 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated 11 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆105Updated last year
- ☆82Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 7 months ago
- ☆68Updated 3 months ago
- ☆48Updated 6 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆37Updated 2 weeks ago
- ☆132Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Governance of the Commons Simulation (GovSim)☆47Updated 4 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆111Updated last year
- LLM Agora, debating between open-source LLMs to refine the answers☆65Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆64Updated 11 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- ☆97Updated 10 months ago
- ☆45Updated last year
- entropix style sampling + GUI☆26Updated 6 months ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆52Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆70Updated 11 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆102Updated 9 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆80Updated last year