MaybeLizzy / UGBenchLinks
☆31Updated 4 months ago
Alternatives and similar repositories for UGBench
Users that are interested in UGBench are comparing it to the libraries listed below
Sorting:
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆52Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆301Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆85Updated 6 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆35Updated 4 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆25Updated last month
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 2 months ago
- A Collection of Papers on Diffusion Language Models☆119Updated last week
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆59Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆96Updated last month
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆61Updated last month
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆49Updated 7 months ago
- ☆24Updated 4 months ago
- Official implement of MIA-DPO☆65Updated 7 months ago
- Data distillation benchmark☆68Updated 2 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆23Updated last month
- EMPO, A Fully Unsupervised RLVR Method☆65Updated last week
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆86Updated 3 weeks ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆80Updated 2 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆30Updated 2 months ago
- The code repository of UniRL☆38Updated 3 months ago
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆56Updated last month
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆27Updated 3 weeks ago
- Fast-Slow Thinking for Large Vision-Language Model Reasoning☆17Updated 4 months ago
- ☆164Updated 3 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆79Updated last year
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆72Updated last month
- [ICLR'25] PiCO: Peer Review in LLMs based on the Consistency Optimization, https://arxiv.org/pdf/2402.01830☆36Updated 6 months ago
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆37Updated 2 months ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆41Updated 3 months ago