ZBox1005 / CoT-UQLinks
[arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆14Updated 10 months ago
Alternatives and similar repositories for CoT-UQ
Users that are interested in CoT-UQ are comparing it to the libraries listed below
Sorting:
- Code for Heima☆59Updated 9 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆59Updated last month
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Updated last year
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Updated 7 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Updated 2 weeks ago
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆82Updated last year
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆179Updated 4 months ago
- NeurIPS 2025: Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆50Updated last week
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Updated 11 months ago
- ☆79Updated last year
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆50Updated last year
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆60Updated 2 weeks ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆129Updated 6 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆74Updated 9 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69Updated last year
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated last week
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Updated last year
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆88Updated 4 months ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40Updated 8 months ago
- The code of RouterDC☆70Updated 9 months ago
- ☆104Updated last month
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆70Updated 7 months ago
- ☆144Updated 9 months ago
- ☆63Updated 6 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Updated last year
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Updated 10 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆85Updated 3 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆104Updated 4 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Updated 11 months ago