StevenZHB / CoT_Causal_Analysis
Repository of paper "LLMs with Chain-of-Thought Are Non-Causal Reasoners"
☆15Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for CoT_Causal_Analysis
- ☆26Updated 6 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆21Updated 4 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 11 months ago
- Methods and evaluation for aligning language models temporally☆24Updated 8 months ago
- ☆66Updated 6 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- ☆24Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 9 months ago
- ☆46Updated 10 months ago
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆28Updated last month
- ☆40Updated 11 months ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- ☆83Updated last year
- ☆39Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆50Updated 7 months ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆19Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆75Updated last month
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆55Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆60Updated 8 months ago
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆25Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆83Updated 4 months ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆57Updated 2 weeks ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆88Updated 7 months ago
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆157Updated 6 months ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆30Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆102Updated 2 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆31Updated 9 months ago