Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
☆32Jun 25, 2025Updated 8 months ago
Alternatives and similar repositories for acl2025-diverse-cot
Users that are interested in acl2025-diverse-cot are comparing it to the libraries listed below
Sorting:
- Code and data release of the paper Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows☆14Oct 4, 2024Updated last year
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆18Mar 4, 2025Updated last year
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆57Nov 5, 2025Updated 4 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- Repo for paper: Controllable Text Generation with Language Constraints☆20Jun 20, 2023Updated 2 years ago
- ☆22May 7, 2025Updated 10 months ago
- ☆32Aug 11, 2025Updated 6 months ago
- NeurIPS 2025 Poster☆26Feb 4, 2025Updated last year
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 9 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- PostgreSQL extension which allows to translate a given source SQL statement into another pre-defined SQL statement.☆24Sep 25, 2025Updated 5 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆329Jan 29, 2026Updated last month
- TextPy: Collaborative Agent Workflow through Programming and Prompting☆26May 9, 2025Updated 10 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆184May 20, 2025Updated 9 months ago
- ☆59Nov 17, 2025Updated 3 months ago
- Evaluate the Quality of Critique☆36Jun 1, 2024Updated last year
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆154May 18, 2024Updated last year
- ☆52Oct 23, 2023Updated 2 years ago
- ☆12Sep 24, 2025Updated 5 months ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 8 months ago
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- ☆11Jul 17, 2023Updated 2 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 4 months ago
- ☆10Sep 29, 2024Updated last year
- Modifying Large Language Models Post-training for Diverse Creative Writing☆52May 12, 2025Updated 9 months ago
- ☆47Mar 25, 2025Updated 11 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Jul 25, 2024Updated last year
- DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models☆25May 23, 2024Updated last year
- ☆12May 23, 2024Updated last year
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- ☆15Dec 2, 2025Updated 3 months ago
- A Temporal Networks Library written in Python☆14Oct 13, 2021Updated 4 years ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- ☆17Dec 23, 2025Updated 2 months ago