aogara-ds / hoodwinkedLinks
Text-based game of lies and deceit, made for language models.
☆32Updated 2 years ago
Alternatives and similar repositories for hoodwinked
Users that are interested in hoodwinked are comparing it to the libraries listed below
Sorting:
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Updated last year
- Measuring the situational awareness of language models☆39Updated last year
- ☆86Updated 2 years ago
- ☆137Updated 2 years ago
- Memoria is a human-inspired memory architecture for neural networks.☆81Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- ☆48Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆23Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 2 months ago
- ☆100Updated last year
- Lottery Ticket Adaptation☆40Updated last year
- ☆75Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆73Updated last year
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆67Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- ☆29Updated last year
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆111Updated last year
- ☆105Updated 11 months ago
- ☆59Updated last year
- Governance of the Commons Simulation (GovSim)☆62Updated 11 months ago
- ☆86Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆132Updated 7 months ago