MaybeLizzy / UGBenchLinks

☆33

Alternatives and similar repositories for UGBench

Users that are interested in UGBench are comparing it to the libraries listed below

Sorting:

czg1225 / VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆53Updated last week
haonan3 / V1
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Updated 5 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆86Updated 7 months ago
GaryStack / MMR-V
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆36Updated 3 months ago
ML-GSAI / Diffusion-LLM-Papers
A Collection of Papers on Diffusion Language Models
☆126Updated last week
ThreeSR / Awesome-Inference-Time-Scaling
Paper List of Inference/Test Time Scaling/Computing
☆307Updated 3 weeks ago
QingyangZhang / EMPO
EMPO, A Fully Unsupervised RLVR Method
☆66Updated this week
mm-vl / ULM-R1
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆26Updated 2 months ago
yu-rp / NeuralLineage
Code for CVPR 2024 Oral "Neural Lineage"
☆17Updated last year
GAIR-NLP / thinking-with-generated-images
Doodling our way to AGI ✏️ 🖼️ 🧠
☆103Updated 3 months ago
RainBowLuoCS / DEEM
(ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.
☆39Updated 2 months ago
PKU-YuanGroup / AsFT
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆29Updated 2 months ago
NUS-HPC-AI-Lab / DD-Ranking
Data distillation benchmark
☆68Updated 3 months ago
iboing / CorDA
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)
☆50Updated 8 months ago
Joshua-Ren / Learning_dynamics_LLM
☆167Updated 4 months ago
pipilurj / bootstrapped-preference-optimization-BPO
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"
☆59Updated last year
LiangrunFlora / Slow-Fast-Sampling
Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…
☆31Updated 2 months ago
ZichenWen1 / DIJA
Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"
☆64Updated last week
SUSTechBruce / LOOK-M
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆101Updated 10 months ago
yu-rp / Dimple
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆98Updated 2 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆155Updated 3 months ago
Dongping-Chen / MLLM-Judge
[ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.
☆84Updated 7 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆80Updated 3 months ago
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆167Updated 2 weeks ago
MME-Benchmarks / MME-CoT
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆129Updated last month
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆42Updated last year
SUSTechBruce / SRPO_MLLMs
[NeurIPS 2025🔥]Main source code of SRPO framework.
☆83Updated this week
MrZilinXiao / ProxyThinker
Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.
☆17Updated this week
MikeWangWZHL / PAPO
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆85Updated last month
NEUIR / PC-Sampler
☆17Updated last week