☆310Nov 17, 2023Updated 2 years ago
Alternatives and similar repositories for reversal_curse
Users that are interested in reversal_curse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆14Aug 2, 2024Updated last year
- Measuring the situational awareness of language models☆41Feb 12, 2024Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆13Apr 24, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The Effect of Sampling Temperature on Problem Solving in Large Language Models☆25Nov 25, 2024Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆43Sep 3, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆323Dec 20, 2023Updated 2 years ago
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 11 months ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆90Oct 1, 2024Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆123Aug 16, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Jun 19, 2024Updated 2 years ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆34Jun 18, 2025Updated last year
- ☆30Mar 11, 2025Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Editing Models with Task Arithmetic☆545Jan 11, 2024Updated 2 years ago
- ☆121May 7, 2025Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆536Apr 24, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Apr 26, 2023Updated 3 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆727Jun 2, 2026Updated 2 weeks ago
- ☆30Jun 19, 2023Updated 3 years ago
- CoPur: Certifiably Robust Collaborative Inference via Feature Purification (NeurIPS 2022)☆11Dec 7, 2022Updated 3 years ago
- ☆46Feb 8, 2024Updated 2 years ago
- ☆58Jun 15, 2023Updated 3 years ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆37Feb 20, 2025Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,839Jun 17, 2025Updated last year
- ☆165Nov 23, 2024Updated last year
- Locating and editing factual associations in GPT (NeurIPS 2022)☆764Apr 20, 2024Updated 2 years ago
- ☆37May 28, 2023Updated 3 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago