veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆157Updated 9 months ago
Alternatives and similar repositories for Faithful-COT:
Users that are interested in Faithful-COT are comparing it to the libraries listed below
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- ☆172Updated last year
- Self-Alignment with Principle-Following Reward Models☆154Updated 11 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated 11 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 3 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆229Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆93Updated last year
- Repository for Decomposed Prompting☆84Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆77Updated 6 months ago
- ☆117Updated 4 months ago
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆70Updated last year
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated 11 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆97Updated last year
- ☆64Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- ☆160Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- Supporting code for ReCEval paper☆28Updated 5 months ago
- Data and Code for Program of Thoughts (TMLR 2023)☆259Updated 9 months ago
- ☆114Updated 7 months ago
- ☆111Updated 7 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆65Updated 10 months ago
- ☆173Updated 6 months ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆86Updated 9 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆108Updated last year
- ☆85Updated last year
- Token-level Reference-free Hallucination Detection☆94Updated last year