Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
☆32Jun 25, 2025Updated 9 months ago
Alternatives and similar repositories for acl2025-diverse-cot
Users that are interested in acl2025-diverse-cot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆14Oct 4, 2024Updated last year
- NeurIPS 2025 Poster☆26Feb 4, 2025Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆329Jan 29, 2026Updated 2 months ago
- TextPy: Collaborative Agent Workflow through Programming and Prompting☆27May 9, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- ☆22May 7, 2025Updated 10 months ago
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆18Mar 4, 2025Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 2 years ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- ☆16Jun 10, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆187May 20, 2025Updated 10 months ago
- ☆23Mar 8, 2024Updated 2 years ago
- [NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning☆25Sep 21, 2025Updated 6 months ago
- A client-only OpenAI LLM Playground for prototyping agents without writing any code.☆22Aug 31, 2023Updated 2 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- Android releases of Clubhouse App☆14Apr 9, 2021Updated 4 years ago
- Examples for the Spartan HPC cluster.☆10Sep 2, 2019Updated 6 years ago
- [ACL 2023] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.☆10Dec 19, 2024Updated last year
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- EARAM for fake news detection☆14Dec 30, 2025Updated 2 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 9 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆188Jun 25, 2025Updated 9 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- 基于 PyDracula 移植的Qt 客户端 UI 框架☆11May 10, 2022Updated 3 years ago
- Evaluate the Quality of Critique☆36Jun 1, 2024Updated last year
- study notes for IT☆11Feb 22, 2020Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated 11 months ago
- ☆31Aug 27, 2024Updated last year
- Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"☆16Jul 5, 2024Updated last year
- Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.☆18Dec 10, 2024Updated last year
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 5 months ago
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆15Oct 8, 2024Updated last year