Open-Source-O1 / o1_Reasoning_Patterns_Study
☆102Updated 3 months ago
Alternatives and similar repositories for o1_Reasoning_Patterns_Study:
Users that are interested in o1_Reasoning_Patterns_Study are comparing it to the libraries listed below
- ☆91Updated 3 months ago
- Code implementation of synthetic continued pretraining☆94Updated 2 months ago
- Reformatted Alignment☆115Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆101Updated this week
- ☆60Updated 3 months ago
- ☆42Updated 3 months ago
- ☆52Updated 5 months ago
- ☆110Updated 2 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆66Updated 2 months ago
- ☆102Updated 2 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆66Updated 3 months ago
- ☆116Updated 9 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆53Updated 5 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆41Updated 8 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆46Updated 8 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆63Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆62Updated last month
- ☆49Updated last year
- ☆143Updated 3 months ago
- ☆85Updated this week
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆116Updated 4 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆117Updated 3 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆108Updated 2 weeks ago
- Fantastic Data Engineering for Large Language Models☆83Updated 2 months ago