MaybeLizzy / UGBench
☆29Updated this week
Alternatives and similar repositories for UGBench:
Users that are interested in UGBench are comparing it to the libraries listed below
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆67Updated 2 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 3 weeks ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆58Updated 2 months ago
- ☆82Updated last month
- Data distillation benchmark☆58Updated this week
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆34Updated last month
- Codes for Merging Large Language Models☆29Updated 9 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆16Updated 10 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆55Updated 8 months ago
- Paper List of Inference/Test Time Scaling/Computing☆207Updated last week
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆46Updated 3 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆53Updated 2 weeks ago
- ☆28Updated 3 weeks ago
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆35Updated 5 months ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆93Updated 5 months ago
- ☆40Updated 4 months ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆12Updated 3 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆44Updated 5 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆39Updated 2 months ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆16Updated last month
- A Self-Training Framework for Vision-Language Reasoning☆77Updated 3 months ago
- ☆47Updated 5 months ago
- Official implement of MIA-DPO☆56Updated 3 months ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆48Updated last week
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆36Updated last month
- A Massive Multi-Discipline Lecture Understanding Benchmark☆16Updated this week
- ✈️ Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆66Updated last month
- ☆35Updated 10 months ago
- ☆95Updated 2 weeks ago
- ☆26Updated last week