ZBox1005 / CoT-UQLinks

[arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"

☆13

Alternatives and similar repositories for CoT-UQ

Users that are interested in CoT-UQ are comparing it to the libraries listed below

Sorting:

shawnricecake / Heima
Code for Heima
☆50Updated 2 months ago
sail-sg / FlowReasoner
☆126Updated 2 months ago
NuoJohnChen / JudgeLRM
☆30Updated 3 months ago
EvolvingLMMs-Lab / multimodal-sae
[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
☆144Updated last week
cxcscmu / Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆46Updated 5 months ago
yihedeng9 / OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
☆93Updated last week
GaryStack / MMR-V
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆31Updated 3 weeks ago
NUS-TRAIL / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆78Updated last month
MingLiiii / Layer_Gradient
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆70Updated 3 weeks ago
VainF / Thinkless
[Preprint 2025] Thinkless: LLM Learns When to Think
☆201Updated 3 weeks ago
Geaming2002 / Ruler
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
☆38Updated 9 months ago
Luckfort / CD
[COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
☆79Updated 5 months ago
xufangzhi / Genius
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆64Updated last month
shuhao02 / RouterDC
The code of RouterDC
☆64Updated 3 months ago
yihedeng9 / STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆69Updated last year
CERT-Lab / lora-sb
Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning
☆49Updated last month
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆208Updated 6 months ago
hewei2001 / ReachQA
Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"
☆54Updated 8 months ago
luka-group / vlm-knowledge-conflict
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆42Updated 8 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆35Updated last month
ChengpengLi1003 / CoRT
☆44Updated 3 weeks ago
sail-sg / Rigging-ChatbotArena
Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)
☆21Updated 4 months ago
justarter / E2URec
Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…
☆37Updated 11 months ago
VILA-Lab / DELT
(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…
☆23Updated 2 months ago
beichenzbc / BoostStep
official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"
☆36Updated 5 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆74Updated 3 weeks ago
zjunlp / unlearn
[ACL 2025] Knowledge Unlearning for Large Language Models
☆39Updated 2 months ago
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆113Updated 3 weeks ago
eric-ai-lab / Soft-Thinking
Official implementation of the paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"
☆184Updated this week
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆124Updated last month