[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆87Mar 23, 2025Updated 11 months ago
Alternatives and similar repositories for SuperCorrect-llm
Users that are interested in SuperCorrect-llm are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆674Jun 28, 2025Updated 8 months ago
- [NeurIPS 2025 Spotlight] LLM post-training suite — featuring ReasonFlux, ReasonFlux-PRM, and ReasonFlux-Coder.☆521Sep 27, 2025Updated 5 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Oct 1, 2024Updated last year
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆46Aug 22, 2025Updated 6 months ago
- ☆123Feb 21, 2025Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- ☆19Mar 10, 2025Updated 11 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- ☆42Dec 16, 2025Updated 2 months ago
- ☆30Mar 11, 2025Updated 11 months ago
- ☆25Aug 23, 2024Updated last year
- Implementation of the Pairformer model used in AlphaFold 3☆14Updated this week
- ☆15Jul 22, 2024Updated last year
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 9 months ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 6 months ago
- ☆26Jan 4, 2026Updated 2 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆114Feb 3, 2026Updated last month
- ☆968Jan 23, 2025Updated last year
- Berkeley Single Cell Computational Microscopy dataset☆18Oct 27, 2025Updated 4 months ago
- ☆12Jun 30, 2024Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 7 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆13Apr 12, 2025Updated 10 months ago
- Official implementation of "OpenCity3D: What do Vision-Language Models know about Urban Environments?" @ WACV2025☆16Nov 24, 2024Updated last year
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆459Apr 18, 2024Updated last year
- Code for Quiet-STaR☆741Aug 21, 2024Updated last year
- Repository for GeoUni, A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions.☆20Jun 12, 2025Updated 8 months ago
- FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data (NAACL 2025)☆15Jul 14, 2025Updated 7 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- Examples for running TeNPy☆16Oct 31, 2025Updated 4 months ago
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆18Mar 4, 2025Updated last year
- ☆16Oct 27, 2024Updated last year
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 4 months ago
- Sequence-level 1F1B schedule for LLMs.☆19Jun 4, 2024Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Nov 27, 2024Updated last year
- ☆18Mar 25, 2024Updated last year