Paper List of Inference/Test Time Scaling/Computing
☆355Feb 27, 2026Updated last week
Alternatives and similar repositories for Awesome-Inference-Time-Scaling
Users that are interested in Awesome-Inference-Time-Scaling are comparing it to the libraries listed below
Sorting:
- [ACM Computing Surveys] The collection of awesome papers on alignment of diffusion models.☆405Feb 6, 2026Updated last month
- ☆56Mar 6, 2025Updated last year
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,360Feb 26, 2026Updated last week
- This is a repo to track the latest autoregressive visual generation papers.☆431Jun 25, 2025Updated 8 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 6 months ago
- Paper list for Efficient Reasoning.☆828Feb 25, 2026Updated last week
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- ☆19Mar 10, 2025Updated 11 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆743Feb 28, 2026Updated last week
- Latest Advances on System-2 Reasoning☆1,329Jun 8, 2025Updated 8 months ago
- [TMLR 2025] Efficient Reasoning Models: A Survey☆300Feb 3, 2026Updated last month
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆787Nov 8, 2025Updated 3 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆443Aug 8, 2025Updated 6 months ago
- Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs☆11Jul 17, 2019Updated 6 years ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓☆3,554May 7, 2025Updated 10 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,649Feb 26, 2026Updated last week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆310Sep 28, 2025Updated 5 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆800Oct 10, 2025Updated 4 months ago
- ☆20Jun 9, 2025Updated 8 months ago
- ☆191Jan 14, 2025Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆214Jun 26, 2025Updated 8 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆75Jan 26, 2026Updated last month
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆31Oct 18, 2025Updated 4 months ago
- s1: Simple test-time scaling☆6,636Jun 25, 2025Updated 8 months ago
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated 8 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35May 8, 2025Updated 9 months ago
- A fork to add multimodal model training to open-r1☆1,493Feb 8, 2025Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆54Mar 9, 2025Updated 11 months ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,346Feb 3, 2026Updated last month
- collection of diffusion model papers categorized by their subareas☆2,161Updated this week
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆857May 23, 2025Updated 9 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆556Jan 4, 2025Updated last year
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,887Jan 8, 2026Updated last month
- Binary neural networks developed by Huawei Noah's Ark Lab☆29Feb 19, 2021Updated 5 years ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆130May 22, 2025Updated 9 months ago
- ☆46Dec 30, 2024Updated last year
- ☆1,137Nov 20, 2025Updated 3 months ago