thu-wyz / inference_scalingView external linksLinks
β78Nov 19, 2024Updated last year
Alternatives and similar repositories for inference_scaling
Users that are interested in inference_scaling are comparing it to the libraries listed below
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 6 months ago
- β109Jul 15, 2025Updated 7 months ago
- β21Jun 27, 2024Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.β36Nov 12, 2024Updated last year
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and trainingβ285May 26, 2024Updated last year
- β20Nov 3, 2024Updated last year
- β22Mar 7, 2025Updated 11 months ago
- β39Dec 14, 2024Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersβ86May 21, 2025Updated 8 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low aβ¦β26Feb 14, 2025Updated last year
- A series of technical report on Slow Thinking with LLMβ759Aug 13, 2025Updated 6 months ago
- β14Mar 20, 2025Updated 10 months ago
- β33Nov 18, 2025Updated 2 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHβ26Dec 23, 2024Updated last year
- β14Jan 24, 2025Updated last year
- β11Mar 13, 2023Updated 2 years ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoningβ57Oct 10, 2025Updated 4 months ago
- π Sliding Window Attention Training for Efficient Large Language Modelsβ15Dec 8, 2025Updated 2 months ago
- Computer Vision Fall 2019 by Chiu-San Fu@ CSIE NTU Taiwanβ10Jan 11, 2020Updated 6 years ago
- Repository for the paper Stream of Search: Learning to Search in Languageβ153Feb 3, 2025Updated last year
- β28May 24, 2025Updated 8 months ago
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β120Dec 10, 2024Updated last year
- The official implementation of InfoRM [NeurIPS 2024].β14Oct 25, 2025Updated 3 months ago
- β11Aug 13, 2024Updated last year
- β52Mar 17, 2025Updated 10 months ago
- β13Jan 22, 2025Updated last year
- β17May 21, 2025Updated 8 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickersβ57Mar 6, 2025Updated 11 months ago
- β20Aug 14, 2025Updated 6 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β185Feb 17, 2025Updated 11 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionβ55Oct 29, 2024Updated last year
- Dateset Reset Policy Optimizationβ31Apr 12, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β329Jan 29, 2026Updated 2 weeks ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β60Jun 3, 2024Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.β24Oct 7, 2025Updated 4 months ago
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengthsβ15Jul 10, 2025Updated 7 months ago
- [AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Dataβ33Apr 7, 2025Updated 10 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignmentβ16Dec 19, 2024Updated last year