Xuchen-Li / llm-arxiv-dailyLinks
Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.
β109Updated this week
Alternatives and similar repositories for llm-arxiv-daily
Users that are interested in llm-arxiv-daily are comparing it to the libraries listed below
Sorting:
- π A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyondβ289Updated 3 weeks ago
- β262Updated 2 months ago
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ252Updated this week
- Latest Advances on Long Chain-of-Thought Reasoningβ492Updated last month
- β147Updated 3 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β154Updated last week
- β164Updated 3 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineeringβ204Updated 4 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!β69Updated 5 months ago
- [arXiv 2025] Efficient Reasoning Models: A Surveyβ259Updated this week
- β51Updated 3 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learningβ249Updated 3 months ago
- Awesome Agent Trainingβ216Updated last month
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.β165Updated 2 months ago
- β283Updated 3 months ago
- β67Updated 2 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.β315Updated last month
- π This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.β200Updated 3 weeks ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMsβ177Updated 2 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"β78Updated 8 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ148Updated 8 months ago
- Test-time preferenece optimization (ICML 2025).β162Updated 4 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concenβ¦β80Updated 2 months ago
- Extrapolating RLVR to General Domains without Verifiersβ151Updated 3 weeks ago
- β122Updated 5 months ago
- Official Repository of "Learning what reinforcement learning can't"β65Updated this week
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β106Updated 3 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Modelsβ149Updated 3 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"β288Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ85Updated 6 months ago