lukasberglund / reversal_curseLinks

☆291

Alternatives and similar repositories for reversal_curse

Users that are interested in reversal_curse are comparing it to the libraries listed below

Sorting:

veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆162Updated last year
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆510Updated last year
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆122Updated 8 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆233Updated 9 months ago
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆228Updated 5 months ago
princeton-nlp / intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
☆223Updated last year
meg-tong / sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
☆86Updated last year
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆289Updated 5 months ago
anthropics / ConstitutionalHarmlessnessPaper
☆240Updated 2 years ago
wenhuchen / TheoremQA
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
☆159Updated last year
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆208Updated 3 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago
da03 / implicit_chain_of_thought
☆135Updated 8 months ago
neulab / gemini-benchmark
☆149Updated last year
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
SALT-NLP / demonstrated-feedback
☆125Updated 10 months ago
likenneth / honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆540Updated 6 months ago
allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆223Updated 8 months ago
kaistAI / CoT-Collection
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
☆245Updated last year
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆207Updated last month
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
da03 / Internalize_CoT_Step_by_Step
☆187Updated 3 months ago
shengliu66 / ICV
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
☆182Updated 5 months ago
ezelikman / STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆206Updated 2 years ago
protagolabs / odyssey-math
☆84Updated 6 months ago
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated 2 years ago
kaistAI / FLASK
[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
☆218Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆257Updated last year
WildEval / ZeroEval
A simple unified framework for evaluating LLMs
☆235Updated 3 months ago