lukasberglund / reversal_curseView external linksLinks
☆306Nov 17, 2023Updated 2 years ago
Alternatives and similar repositories for reversal_curse
Users that are interested in reversal_curse are comparing it to the libraries listed below
Sorting:
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- ☆12Apr 24, 2024Updated last year
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆316Dec 20, 2023Updated 2 years ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Jun 18, 2025Updated 7 months ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Scaling Data-Constrained Language Models☆340Jun 28, 2025Updated 7 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆121Aug 16, 2023Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆89Oct 1, 2024Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Jun 19, 2024Updated last year
- Teaching Models to Express Their Uncertainty in Words☆39May 26, 2022Updated 3 years ago
- ☆43Sep 3, 2024Updated last year
- CoPur: Certifiably Robust Collaborative Inference via Feature Purification (NeurIPS 2022)☆11Dec 7, 2022Updated 3 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆37May 28, 2023Updated 2 years ago
- The Effect of Sampling Temperature on Problem Solving in Large Language Models☆24Nov 25, 2024Updated last year
- ☆16Mar 22, 2025Updated 10 months ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- ☆30Mar 11, 2025Updated 11 months ago
- ☆160Nov 23, 2024Updated last year
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆667Nov 20, 2023Updated 2 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Oct 19, 2023Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Sep 10, 2024Updated last year
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 8 months ago
- Salesforce open-source LLMs with 8k sequence length.☆724Jan 31, 2025Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆506Apr 24, 2025Updated 9 months ago
- ☆42Sep 19, 2024Updated last year
- PROSE Public Benchmark Suite☆31Sep 15, 2025Updated 5 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,814Jun 17, 2025Updated 8 months ago
- Locating and editing factual associations in GPT (NeurIPS 2022)☆727Apr 20, 2024Updated last year
- ☆115May 7, 2025Updated 9 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Oct 10, 2023Updated 2 years ago
- Self-Alignment with Principle-Following Reward Models☆169Sep 18, 2025Updated 4 months ago
- ☆38Jul 24, 2025Updated 6 months ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- Editing Models with Task Arithmetic☆532Jan 11, 2024Updated 2 years ago