UKPLab / arxiv2024-divergent-cot
Code for the 2024 arXiv publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models"
☆22Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for arxiv2024-divergent-cot
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 10 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆62Updated 3 months ago
- ☆37Updated 10 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆57Updated last month
- [EMNLP 2024 Findings] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models☆19Updated last week
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆75Updated last month
- trending projects & awesome papers about data-centric llm studies.☆31Updated 2 weeks ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago
- Do Large Language Models Know What They Don’t Know?☆85Updated 2 weeks ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆118Updated 4 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆77Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆88Updated 2 months ago
- Code implementation of synthetic continued pretraining☆60Updated last month
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆83Updated 4 months ago
- ☆40Updated 11 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆46Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆50Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆49Updated last week
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener …☆58Updated 4 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated 5 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- The repository for paper <Evaluating Open-QA Evaluation>☆23Updated 7 months ago
- ☆66Updated 6 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆60Updated 8 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆22Updated 11 months ago