causalNLP / corr2causeLinks
Data and code for the Corr2Cause paper (ICLR 2024)
☆108Updated last year
Alternatives and similar repositories for corr2cause
Users that are interested in corr2cause are comparing it to the libraries listed below
Sorting:
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆119Updated last year
- ☆96Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆122Updated 8 months ago
- The Prism Alignment Project☆79Updated last year
- ☆51Updated 3 months ago
- ☆72Updated last year
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆75Updated last year
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Updated 5 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆125Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆113Updated last year
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆37Updated 2 years ago
- augmented LLM with self reflection☆129Updated last year
- ☆96Updated last year
- ☆99Updated last year
- ☆52Updated 2 years ago
- ☆32Updated last year
- Code/data for MARG (multi-agent review generation)☆46Updated 8 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆182Updated 5 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆108Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- [NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models☆100Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆68Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆151Updated 5 months ago
- Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".☆40Updated 8 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆112Updated last month
- ☆89Updated 11 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year