ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆127Updated last week
Alternatives and similar repositories for Awesome-Inference-Time-Scaling:
Users that are interested in Awesome-Inference-Time-Scaling are comparing it to the libraries listed below
- Paper list for Efficient Reasoning.☆311Updated this week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆55Updated last month
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆65Updated this week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆41Updated 2 weeks ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆98Updated 2 weeks ago
- A Survey on Efficient Reasoning for LLMs☆116Updated this week
- ☆48Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆153Updated this week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆54Updated 5 months ago
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆138Updated 3 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆148Updated last week
- ☆71Updated last week
- A RLHF Infrastructure for Vision-Language Models☆167Updated 4 months ago
- The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark"☆45Updated 2 months ago
- ☆21Updated 3 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆33Updated 8 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆92Updated 4 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆54Updated 7 months ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆83Updated last week
- SOTA RL fine-tuning solution for advanced math reasoning of LLM☆91Updated this week
- A Survey on the Honesty of Large Language Models☆56Updated 3 months ago
- Awesome RL-based LLM Reasoning☆341Updated this week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆69Updated 4 months ago
- ☆131Updated 8 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆119Updated 8 months ago
- The official code repository for PRMBench.☆68Updated last month
- ☆64Updated 9 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 3 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆72Updated 7 months ago
- A Self-Training Framework for Vision-Language Reasoning☆71Updated 2 months ago