lukasberglund / reversal_curse
☆259Updated 10 months ago
Related projects: ⓘ
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆436Updated 3 weeks ago
- RewardBench: the first evaluation tool for reward models.☆352Updated last week
- Benchmarking LLMs with Challenging Tasks from Real Users☆182Updated last month
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆423Updated 7 months ago
- The official evaluation suite and dynamic data release for MixEval.☆200Updated last week
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆112Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆405Updated 4 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆130Updated 2 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆290Updated 5 months ago
- Chain-of-Hindsight, A Scalable RLHF Method☆213Updated 11 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆201Updated 10 months ago
- ☆419Updated 2 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆260Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆300Updated 8 months ago
- ☆284Updated 3 months ago
- ☆246Updated 9 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆151Updated 4 months ago
- DSIR large-scale data selection framework for language model training☆221Updated 5 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆209Updated 8 months ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆411Updated 2 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆218Updated 5 months ago
- Arena-Hard-Auto: An automatic LLM benchmark.☆421Updated 2 weeks ago
- ☆239Updated 10 months ago
- Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆140Updated 3 months ago
- GPQA: A Graduate-Level Google-Proof Q&A Benchmark☆139Updated 5 months ago
- ☆179Updated last week
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆135Updated last month
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆230Updated 5 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆416Updated 6 months ago
- A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆300Updated 11 months ago