UKPLab / arxiv2024-divergent-cot

Code for the 2024 arXiv publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models"

☆24

Alternatives and similar repositories for arxiv2024-divergent-cot:

Users that are interested in arxiv2024-divergent-cot are comparing it to the libraries listed below

RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
ChengpengLi1003 / DotaMath
☆29Updated 4 months ago
starrYYxuan / LeCo
This the implementation of LeCo
☆31Updated 3 months ago
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆110Updated 9 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆48Updated 10 months ago
THUNLP-MT / SKR
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
☆26Updated last year
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆47Updated 4 months ago
RUCAIBox / HaluEval-2.0
☆41Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
OpenLMLab / LongWanjuan
Towards Systematic Measurement for Long Text Quality
☆34Updated 8 months ago
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆55Updated last year
Zce1112zslx / IKE
☆41Updated last year
halfrot / ALaRM
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆34Updated 11 months ago
sunlab-osu / Understanding-CoT
☆86Updated last year
liyucheng09 / Contamination_Detector
Lightweight tool to identify Data Contamination in LLMs evaluation
☆50Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆60Updated 9 months ago
GAIR-NLP / OPO
☆49Updated last year
chujiezheng / LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
☆74Updated 10 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
RUCKBReasoning / CoT-based-Synthesizer
The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"
☆24Updated 3 months ago
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
tengxiaoliu / XoT
[EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
☆27Updated last year
WENGSYX / Self-Verification
We have released the code and demo program required for LLM with self-verification
☆59Updated last year
icip-cas / Verifier-Engineering
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆57Updated 5 months ago
RUCAIBox / BAMBOO
☆35Updated last year
hongbinye / Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
☆47Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆63Updated last year
LuLuLuyi / LongHeads
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆29Updated last year
feiyang-k / AutoScale
Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]
☆12Updated 3 months ago