lukasberglund / reversal_curseLinks
☆300Updated 2 years ago
Alternatives and similar repositories for reversal_curse
Users that are interested in reversal_curse are comparing it to the libraries listed below
Sorting:
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆303Updated 11 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆535Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆132Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆245Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆246Updated 10 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆165Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆119Updated 2 years ago
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆160Updated last year
- ☆139Updated last year
- ☆85Updated 11 months ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Self-Alignment with Principle-Following Reward Models☆169Updated 4 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆118Updated 2 years ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆100Updated 2 years ago
- ☆150Updated 2 years ago
- Evaluating LLMs with fewer examples☆169Updated last year
- Scaling Data-Constrained Language Models☆342Updated 6 months ago
- ☆249Updated 3 years ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆124Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆226Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆222Updated last month
- ☆202Updated 9 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆218Updated 2 years ago
- Function Vectors in Large Language Models (ICLR 2024)☆190Updated 9 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆196Updated 11 months ago
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆90Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆253Updated 2 years ago
- A simple unified framework for evaluating LLMs☆258Updated 9 months ago