Xuchen-Li / llm-arxiv-daily
Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.
☆33Updated this week
Alternatives and similar repositories for llm-arxiv-daily:
Users that are interested in llm-arxiv-daily are comparing it to the libraries listed below
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆118Updated last week
- A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆42Updated last week
- ☆138Updated 3 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆162Updated 2 weeks ago
- ☆85Updated 3 weeks ago
- ☆82Updated 2 weeks ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆78Updated 3 weeks ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆66Updated last week
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- ☆18Updated 3 weeks ago
- ☆59Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆67Updated last month
- A Survey on Efficient Reasoning for LLMs☆204Updated last week
- ☆84Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆131Updated last week
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆107Updated 2 weeks ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆58Updated 5 months ago
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆82Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- ☆107Updated last month
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆95Updated 2 months ago
- Paper list for Efficient Reasoning.☆331Updated this week
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆58Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆75Updated 2 weeks ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆56Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆167Updated 2 weeks ago
- ☆17Updated last month
- ☆55Updated last month