β80Nov 19, 2024Updated last year
Alternatives and similar repositories for inference_scaling
Users that are interested in inference_scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Code Repository for [AutoScaleπ: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*β¦β14Aug 8, 2025Updated 9 months ago
- β21Jun 27, 2024Updated last year
- β110Jul 15, 2025Updated 10 months ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and trainingβ287May 26, 2024Updated 2 years ago
- β15Mar 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic prompt optimization framework for multi-step agent tasks.β37Nov 12, 2024Updated last year
- β44Dec 14, 2024Updated last year
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inferenceβ43Mar 28, 2026Updated 2 months ago
- The rule-based evaluation subset and code implementation of Omni-MATHβ27Dec 23, 2024Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witβ¦β157Jul 12, 2024Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".β17Sep 15, 2024Updated last year
- A series of technical report on Slow Thinking with LLMβ765Aug 13, 2025Updated 9 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low aβ¦β28Feb 14, 2025Updated last year
- β24Dec 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β20Nov 3, 2024Updated last year
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengthsβ19Jul 10, 2025Updated 10 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]β32Jan 23, 2025Updated last year
- β23Mar 7, 2025Updated last year
- [TMLR] Process Reward Models That Thinkβ89Nov 29, 2025Updated 6 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capabilityβ14Mar 11, 2025Updated last year
- Repo of paper "Free Process Rewards without Process Labels"β171Mar 14, 2025Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]β21May 2, 2024Updated 2 years ago
- β52Mar 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasksβ268May 5, 2025Updated last year
- Repository for the paper Stream of Search: Learning to Search in Languageβ153Feb 3, 2025Updated last year
- [NeurIPS'24] Official code for *π―DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*β122Dec 10, 2024Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β331Jan 29, 2026Updated 4 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ32Apr 8, 2024Updated 2 years ago
- [EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoningβ50Oct 11, 2024Updated last year
- β969Jan 23, 2025Updated last year
- β30May 30, 2025Updated 11 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.β86Nov 2, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β114Sep 25, 2024Updated last year
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelangβ44Nov 19, 2025Updated 6 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)β59Oct 10, 2025Updated 7 months ago
- [ICLR 2025] 𧬠RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)β193Feb 17, 2025Updated last year
- β145May 6, 2025Updated last year
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectβ¦β131Jan 31, 2026Updated 3 months ago
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AIβ31Sep 25, 2024Updated last year