sunblaze-ucb / reasoning_ladderView external linksLinks
☆35May 16, 2025Updated 8 months ago
Alternatives and similar repositories for reasoning_ladder
Users that are interested in reasoning_ladder are comparing it to the libraries listed below
Sorting:
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆20Feb 26, 2025Updated 11 months ago
- ☆20Mar 25, 2025Updated 10 months ago
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 8 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- ☆16Feb 22, 2025Updated 11 months ago
- ☆14Mar 20, 2025Updated 10 months ago
- ☆29Nov 9, 2025Updated 3 months ago
- ☆33Nov 18, 2025Updated 2 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- Exploration of automated dataset selection approaches at large scales.☆52Mar 4, 2025Updated 11 months ago
- ☆16Sep 4, 2025Updated 5 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- ☆21Jul 21, 2025Updated 6 months ago
- ☆19Jun 4, 2025Updated 8 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 7 months ago
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆38Jun 24, 2025Updated 7 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Oct 17, 2025Updated 3 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated 11 months ago
- The dataset consists of public social media url pairs and the corresponding entailment label for an external conference (ACL 2021). Each …☆14Aug 16, 2021Updated 4 years ago
- 🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence☆26May 21, 2025Updated 8 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆27Feb 25, 2025Updated 11 months ago
- A tool for an analysis of LLM generations.☆42Oct 13, 2025Updated 4 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 10 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated 11 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆129Jul 24, 2025Updated 6 months ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Mar 13, 2025Updated 11 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆23Oct 6, 2025Updated 4 months ago
- The code implementation of Symbolic-MoE☆46Sep 2, 2025Updated 5 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated 11 months ago
- ☆50Jan 28, 2025Updated last year
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Apr 22, 2025Updated 9 months ago
- A Sober Look at Language Model Reasoning☆92Nov 18, 2025Updated 2 months ago
- The official implementation of dual-view molecule pre-training.☆43Nov 22, 2021Updated 4 years ago
- ☆21May 3, 2025Updated 9 months ago
- ☆22Feb 12, 2025Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆26Oct 14, 2025Updated 4 months ago