☆308Nov 17, 2023Updated 2 years ago
Alternatives and similar repositories for reversal_curse
Users that are interested in reversal_curse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'☆13Aug 2, 2024Updated last year
- Measuring the situational awareness of language models☆41Feb 12, 2024Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆12Apr 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- The Effect of Sampling Temperature on Problem Solving in Large Language Models☆24Nov 25, 2024Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆43Sep 3, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆319Dec 20, 2023Updated 2 years ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 9 months ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆32Jun 18, 2025Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆90Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆121Aug 16, 2023Updated 2 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Jun 19, 2024Updated last year
- ☆30Mar 11, 2025Updated last year
- ☆17Dec 21, 2023Updated 2 years ago
- Parkar and Kim et al.'s paper on Can LLMs Select Important Instructions to Annotate?"☆13Jul 4, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Editing Models with Task Arithmetic☆537Jan 11, 2024Updated 2 years ago
- ☆117May 7, 2025Updated 11 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆524Apr 24, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 26, 2023Updated 2 years ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 10 months ago
- Salesforce open-source LLMs with 8k sequence length.☆726Jan 31, 2025Updated last year
- ☆30Jun 19, 2023Updated 2 years ago
- ☆46Feb 8, 2024Updated 2 years ago
- CoPur: Certifiably Robust Collaborative Inference via Feature Purification (NeurIPS 2022)☆11Dec 7, 2022Updated 3 years ago
- ☆58Jun 15, 2023Updated 2 years ago
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- ☆160Nov 23, 2024Updated last year
- ☆35Feb 20, 2025Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,837Jun 17, 2025Updated 10 months ago
- Locating and editing factual associations in GPT (NeurIPS 2022)☆744Apr 20, 2024Updated last year
- ☆37May 28, 2023Updated 2 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago