ZBox1005 / CoT-UQLinks
[arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆15Updated 8 months ago
Alternatives and similar repositories for CoT-UQ
Users that are interested in CoT-UQ are comparing it to the libraries listed below
Sorting:
- Code for Heima☆58Updated 7 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated this week
- ☆142Updated 7 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆54Updated 2 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆49Updated last year
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆74Updated 5 months ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆82Updated 10 months ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Updated 9 months ago
- Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆46Updated last month
- ☆134Updated 9 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆86Updated 5 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆39Updated last year
- ☆38Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆123Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆69Updated 5 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆70Updated 6 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆50Updated 10 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆245Updated 2 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆70Updated last year
- ☆89Updated this week
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆86Updated 2 months ago
- Research works from Tencent AI Lab regarding self-evolving agents☆70Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- ☆30Updated 3 weeks ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆34Updated last year
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 5 months ago
- ☆15Updated 7 months ago
- Official Repository of LatentSeek☆70Updated 6 months ago