UKPLab/acl2025-diverse-cot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UKPLab/acl2025-diverse-cot)

UKPLab / acl2025-diverse-cot

Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"

☆32

Alternatives and similar repositories for acl2025-diverse-cot

Users that are interested in acl2025-diverse-cot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 6 months ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
Xt-cyh / CoDI-Eval
View on GitHub
☆22May 7, 2025Updated last year
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
wenhycs / EMNLP2021-Utilizing-Relative-Event-Time-to-Enhance-Event-Event-Temporal-Relation-Extraction
View on GitHub
☆12Oct 4, 2021Updated 4 years ago
jinpz / q_sharp
View on GitHub
The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
☆20Mar 4, 2025Updated last year
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆190May 20, 2025Updated last year
mtharrison / promptscaper
View on GitHub
A client-only OpenAI LLM Playground for prototyping agents without writing any code.
☆22Aug 31, 2023Updated 2 years ago
facebookresearch / iclmlp
View on GitHub
Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"
☆18May 29, 2023Updated 3 years ago
ZhaozwTD / MMCAN
View on GitHub
Source code of "Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection"
☆14Nov 17, 2023Updated 2 years ago
resbaz / spartan-examples
View on GitHub
Examples for the Spartan HPC cluster.
☆10Sep 2, 2019Updated 6 years ago
Miaoranmmm / SelfChecker
View on GitHub
codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"
☆12Feb 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Gringham / explainable-metrics-machine-translation
View on GitHub
explainable-machine-translation-metrics
☆12Jul 15, 2022Updated 4 years ago
qingpingwan / EARAM
View on GitHub
EARAM for fake news detection
☆14Dec 30, 2025Updated 6 months ago
tamohannes / urartu
View on GitHub
Build ML pipelines with smart caching and remote execution. Develop locally, deploy to HPC clusters instantly. Track with Aim. 🎯
☆13Feb 10, 2026Updated 5 months ago
xhan77 / context-aware-decoding
View on GitHub
☆58Nov 18, 2024Updated last year
GAIR-NLP / MetaCritique
View on GitHub
Evaluate the Quality of Critique
☆37Jun 1, 2024Updated 2 years ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated last year
HKBUNLP / Mr.Harm-EMNLP2023
View on GitHub
Code for our EMNLP 2023 paper - Beneath the Surface: Unveiling Harmful Memes with Multimodal Reasoning Distilled from Large Language Mode…
☆15May 5, 2024Updated 2 years ago
sastpg / RFTT
View on GitHub
RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 5 months ago
xiaoyuisrain / metaphor-understanding-challenge
View on GitHub
☆24Mar 8, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
genglinliu / UnknownBench
View on GitHub
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
☆14Feb 20, 2024Updated 2 years ago
slashml / awesome-finetuning
View on GitHub
☆31Aug 27, 2024Updated last year
wellecks / naturalprover
View on GitHub
NaturalProver: Grounded Mathematical Proof Generation with Language Models
☆40Mar 24, 2023Updated 3 years ago
Aries-iai / Manifold_Steering
View on GitHub
The official implementation for "Mitigating Overthinking in Large Reasoning Models via Manifold Steering"
☆15May 29, 2025Updated last year
MLLM-Data-Contamination / MM-Detect
View on GitHub
Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings
☆18Oct 17, 2025Updated 9 months ago
tilde-research / comp-muon-release
View on GitHub
Compositional Muon release
☆23Jun 5, 2026Updated last month
UKPLab / tmlr2026-manifold-analysis
View on GitHub
☆21Mar 3, 2026Updated 4 months ago
kite99520 / DialSummEval
View on GitHub
Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"
☆14Jul 22, 2025Updated last year
Tomsawyerhu / LRP4RAG
View on GitHub
RAG Hallucination Detecting By LRP.
☆12Mar 31, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
nihalsid / shape_sdf
View on GitHub
Simple MLP for representing the SDF of a single shape
☆17Jun 30, 2023Updated 3 years ago
keing1 / reward-hack-generalization
View on GitHub
Datasets used in the paper "Reward hacking behavior can generalize across tasks"
☆15Aug 17, 2025Updated 11 months ago
Sahandfer / EmoBench
View on GitHub
[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models
☆117May 16, 2025Updated last year
RUCKBReasoning / CodeRM
View on GitHub
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'
☆27May 16, 2025Updated last year
prafulla77 / Discourse_Profiling
View on GitHub
☆23Sep 21, 2020Updated 5 years ago
RUCAIBox / HaluAgent
View on GitHub
☆23Jul 1, 2024Updated 2 years ago
wyzjack / CNTP
View on GitHub
[ACL 2025] Cautious Next Token Prediction
☆16Jul 24, 2025Updated last year