VITA-Group / o1-planningLinks

On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability

☆40

Alternatives and similar repositories for o1-planning

Users that are interested in o1-planning are comparing it to the libraries listed below

Sorting:

yale-nlp / refdpo
☆16Updated last year
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆86Updated last year
DualityRL / multi-attempt
☆19Updated 7 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 7 months ago
cognitiveailab / GPT-simulator
☆29Updated last year
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆35Updated last month
LAMDASZ-ML / Self-Backtracking
☆49Updated 8 months ago
Gen-Verse / CURE
[NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆122Updated 3 weeks ago
GAIR-NLP / OlympicArena
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆105Updated 7 months ago
facebookresearch / dualformer
implementation of dualformer
☆20Updated 7 months ago
neulab / MultiUI
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆52Updated 9 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆37Updated last year
sunblaze-ucb / omega
☆40Updated 3 months ago
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆46Updated 7 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆121Updated last year
cvenhoff / steering-thinking-llms
☆27Updated 3 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆30Updated 2 months ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Updated this week
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆41Updated last year
locuslab / scaling_laws_data_filtering
☆65Updated last year
Wang-ML-Lab / multimodal-needle-in-a-haystack
[NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models
☆49Updated 5 months ago
microsoft / LEMA
official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"
☆58Updated last year
NuoJohnChen / JudgeLRM
JudgeLRM: Large Reasoning Models as a Judge
☆39Updated 3 weeks ago
THUDM / Self-Contrast
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
☆20Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated last year
LCM-Lab / LCM_Stack
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆13Updated 7 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆40Updated 7 months ago
open-compass / CriticEval
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
☆46Updated 10 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year