β79Nov 19, 2024Updated last year
Alternatives and similar repositories for inference_scaling
Users that are interested in inference_scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β13Aug 8, 2025Updated 7 months ago
- β21Jun 27, 2024Updated last year
- β109Jul 15, 2025Updated 8 months ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and trainingβ285May 26, 2024Updated last year
- β15Mar 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inferenceβ31Mar 19, 2026Updated last week
- Automatic prompt optimization framework for multi-step agent tasks.β37Nov 12, 2024Updated last year
- β39Dec 14, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATHβ27Dec 23, 2024Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β153Jul 12, 2024Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".β16Sep 15, 2024Updated last year
- A series of technical report on Slow Thinking with LLMβ763Aug 13, 2025Updated 7 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low aβ¦β26Feb 14, 2025Updated last year
- β24Dec 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- β20Nov 3, 2024Updated last year
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengthsβ17Jul 10, 2025Updated 8 months ago
- [TMLR] Process Reward Models That Thinkβ83Nov 29, 2025Updated 4 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- β23Mar 7, 2025Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capabilityβ14Mar 11, 2025Updated last year
- Repo of paper "Free Process Rewards without Process Labels"β170Mar 14, 2025Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]β21May 2, 2024Updated last year
- β52Mar 17, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ265May 5, 2025Updated 10 months ago
- Repository for the paper Stream of Search: Learning to Search in Languageβ154Feb 3, 2025Updated last year
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β121Dec 10, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β329Jan 29, 2026Updated 2 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ31Apr 8, 2024Updated last year
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoningβ50Oct 11, 2024Updated last year
- β969Jan 23, 2025Updated last year
- β27May 30, 2025Updated 9 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.β86Nov 2, 2025Updated 4 months ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- β112Sep 25, 2024Updated last year
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelangβ44Nov 19, 2025Updated 4 months ago
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β187Feb 17, 2025Updated last year
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)β58Oct 10, 2025Updated 5 months ago
- β145May 6, 2025Updated 10 months ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectβ¦β134Jan 31, 2026Updated last month
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersβ86May 21, 2025Updated 10 months ago