The related works and background techniques about Openai o1
ā222Jan 7, 2025Updated last year
Alternatives and similar repositories for Awesome-LLM-Reasoning-Openai-o1-Survey
Users that are interested in Awesome-LLM-Reasoning-Openai-o1-Survey are comparing it to the libraries listed below
Sorting:
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 š and reasoning techniques.ā6,898Dec 17, 2025Updated 3 months ago
- O1 Replication Journeyā1,999Jan 14, 2025Updated last year
- Large Reasoning Modelsā807Dec 3, 2024Updated last year
- A bibliography and survey of the papers surrounding o1ā1,212Nov 16, 2024Updated last year
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsā1,837Jan 17, 2025Updated last year
- ā17Nov 3, 2024Updated last year
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 šā3,568May 7, 2025Updated 10 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningā123May 6, 2025Updated 10 months ago
- A series of technical report on Slow Thinking with LLMā761Aug 13, 2025Updated 7 months ago
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"ā190Oct 28, 2024Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.ā12Oct 12, 2024Updated last year
- GenRM-CoT: Data release for verification rationalesā67Oct 16, 2024Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)ā694Jan 20, 2025Updated last year
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)ā9,191Updated this week
- [ICML'2024] Can AI Assistants Know What They Don't Know?ā85Feb 5, 2024Updated 2 years ago
- ā1,347Nov 21, 2024Updated last year
- Scaling Agentic Environments Automatically.ā54Jan 22, 2026Updated last month
- Interpretable Contrastive Monte Carlo Tree Search Reasoningā51Nov 9, 2024Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"ā18Oct 26, 2024Updated last year
- Simple RL training for reasoningā3,841Dec 23, 2025Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoningā88Jan 23, 2025Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".ā55Nov 29, 2024Updated last year
- ā88Jun 1, 2023Updated 2 years ago
- ā15Jul 9, 2025Updated 8 months ago
- An Open Large Reasoning Model for Real-World Solutionsā1,539Feb 13, 2026Updated last month
- ā342Jun 5, 2025Updated 9 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open Dā¦ā12Sep 16, 2022Updated 3 years ago
- Codebase for multilingual neural machine translationā13Nov 24, 2022Updated 3 years ago
- Paper list for Efficient Reasoning.ā856Updated this week
- Code for Research Project TLDRā25Jul 28, 2025Updated 7 months ago
- Official Repo for Open-Reasoner-Zeroā2,086Jun 2, 2025Updated 9 months ago
- Latest Advances on System-2 Reasoningā1,339Jun 8, 2025Updated 9 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"ā66Feb 5, 2025Updated last year
- ā553Jan 2, 2025Updated last year
- GUICourse: From General Vision Langauge Models to Versatile GUI Agentsā136Mar 1, 2026Updated 2 weeks ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)ā66Oct 18, 2024Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"ā160Oct 23, 2025Updated 4 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learningā58Oct 16, 2025Updated 5 months ago
- verl: Volcano Engine Reinforcement Learning for LLMsā19,919Updated this week