YanqiDai / MMRoleLinks
(ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents
β89Updated 10 months ago
Alternatives and similar repositories for MMRole
Users that are interested in MMRole are comparing it to the libraries listed below
Sorting:
- β59Updated last year
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) πβ35Updated 8 months ago
- Test-time preferenece optimization (ICML 2025).β174Updated 7 months ago
- β90Updated last year
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"β57Updated last month
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.β163Updated 3 months ago
- β175Updated 3 weeks ago
- π§Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learningβ298Updated 2 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tooβ¦β372Updated 4 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ156Updated 6 months ago
- β87Updated last year
- β57Updated 5 months ago
- β161Updated 11 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language β¦β157Updated 7 months ago
- The demo, code and data of FollowRAGβ75Updated 6 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agentsβ208Updated 7 months ago
- A Self-Training Framework for Vision-Language Reasoningβ88Updated 11 months ago
- Extrapolating RLVR to General Domains without Verifiersβ184Updated 4 months ago
- β152Updated 7 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluationsβ143Updated last month
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuningβ154Updated last year
- An Easy-to-use Hallucination Detection Framework for LLMs.β62Updated last year
- β63Updated 7 months ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.β73Updated 2 months ago
- β69Updated 6 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.β80Updated last month
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β28Updated 10 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".β91Updated last month
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resourcesβ211Updated 3 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β53Updated 9 months ago