ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
β195Updated this week
Alternatives and similar repositories for Awesome-Inference-Time-Scaling:
Users that are interested in Awesome-Inference-Time-Scaling are comparing it to the libraries listed below
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyondβ191Updated this week
- [Arxiv 2025] Efficient Reasoning Models: A Surveyβ107Updated this week
- Paper list for Efficient Reasoning.β403Updated this week
- A Survey on Efficient Reasoning for LLMsβ332Updated this week
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Modelsβ87Updated 2 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"β144Updated last month
- β93Updated last week
- A RLHF Infrastructure for Vision-Language Modelsβ171Updated 5 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuningβ188Updated 4 months ago
- β90Updated 3 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ133Updated last month
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"β51Updated this week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β72Updated this week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ65Updated 2 months ago
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β92Updated 5 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Methodβ156Updated 8 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Modelsβ117Updated last week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ191Updated last month
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!β49Updated 3 weeks ago
- Awesome RL-based LLM Reasoningβ450Updated last week
- π This is a repository for organizing papers, codes, and other resources related to unified multimodal models.β173Updated 2 weeks ago
- Paper collections of multi-modal LLM for Math/STEM/Code.β88Updated this week
- A comprehensive collection of process reward models.β67Updated this week
- Latest Advances on Long Chain-of-Thought Reasoningβ241Updated last week
- β93Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It containsβ¦β190Updated last week
- β48Updated 4 months ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"β107Updated this week
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understandingβ75Updated 3 weeks ago
- β99Updated 9 months ago