β79Nov 19, 2024Updated last year
Alternatives and similar repositories for inference_scaling
Users that are interested in inference_scaling are comparing it to the libraries listed below
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 7 months ago
- β109Jul 15, 2025Updated 7 months ago
- β21Jun 27, 2024Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.β37Nov 12, 2024Updated last year
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and trainingβ285May 26, 2024Updated last year
- β20Nov 3, 2024Updated last year
- β23Mar 7, 2025Updated last year
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inferenceβ22Feb 9, 2026Updated last month
- β39Dec 14, 2024Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersβ86May 21, 2025Updated 9 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low aβ¦β26Feb 14, 2025Updated last year
- A series of technical report on Slow Thinking with LLMβ761Aug 13, 2025Updated 6 months ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Modelsβ12Nov 1, 2025Updated 4 months ago
- β14Mar 20, 2025Updated 11 months ago
- β34Nov 18, 2025Updated 3 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHβ26Dec 23, 2024Updated last year
- β14Jan 24, 2025Updated last year
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoningβ57Oct 10, 2025Updated 4 months ago
- π Sliding Window Attention Training for Efficient Large Language Modelsβ16Dec 8, 2025Updated 3 months ago
- β11Mar 13, 2023Updated 2 years ago
- Computer Vision Fall 2019 by Chiu-San Fu@ CSIE NTU Taiwanβ10Jan 11, 2020Updated 6 years ago
- Repository for the paper Stream of Search: Learning to Search in Languageβ154Feb 3, 2025Updated last year
- β28May 24, 2025Updated 9 months ago
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β121Dec 10, 2024Updated last year
- The official implementation of InfoRM [NeurIPS 2024].β15Oct 25, 2025Updated 4 months ago
- β11Aug 13, 2024Updated last year
- β13Jan 22, 2025Updated last year
- β20Aug 14, 2025Updated 6 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickersβ57Mar 6, 2025Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMsβ17May 21, 2025Updated 9 months ago
- β52Mar 17, 2025Updated 11 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β185Feb 17, 2025Updated last year
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejectionβ55Oct 29, 2024Updated last year
- Dateset Reset Policy Optimizationβ31Apr 12, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β329Jan 29, 2026Updated last month
- [AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Dataβ33Apr 7, 2025Updated 11 months ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)β15Jul 20, 2023Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.β25Oct 7, 2025Updated 5 months ago