EMMA-Bench / EMMALinks
☆56Updated 3 weeks ago
Alternatives and similar repositories for EMMA
Users that are interested in EMMA are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Official Implementation of GLIDER☆44Updated last week
- The official code repository for PRMBench.☆73Updated 3 months ago
- ☆100Updated last month
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆60Updated 5 months ago
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 4 months ago
- Code release for VTW (AAAI 2025) Oral☆43Updated 4 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆41Updated 2 weeks ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆57Updated 2 weeks ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆107Updated last month
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆35Updated 3 weeks ago
- A comprehensive collection of process reward models.☆85Updated 2 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆120Updated last week
- ☆19Updated last month
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆212Updated this week
- ☆83Updated last month
- repo for paper https://arxiv.org/abs/2504.13837☆144Updated last week
- The official code repository for the FullFront benchmark☆16Updated 3 weeks ago
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆120Updated last week
- ☆74Updated 11 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆41Updated 2 months ago
- ☆13Updated this week
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆19Updated 3 months ago
- ☆24Updated 3 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆112Updated last month
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆40Updated 6 months ago
- [arXiv] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs☆22Updated 2 weeks ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆51Updated last week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆53Updated 2 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆82Updated 5 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆21Updated 7 months ago