Gaiejj / omniairl
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated 2 years ago
Alternatives and similar repositories for omniairl
Users that are interested in omniairl are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆51Updated last month
- ☆117Updated 3 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆40Updated 3 weeks ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆314Updated 4 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆11Updated 7 months ago
- The homework of robos learning base.☆11Updated last year
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆20Updated 2 months ago
- ⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆136Updated last week
- ☆24Updated 3 months ago
- [CVPR2024] This is the official implement of MP5☆101Updated 10 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆31Updated 8 months ago
- 关于LLM和Multimodal LLM的paper list☆38Updated last week
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆84Updated 8 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆142Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆78Updated 3 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆76Updated last week
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆112Updated last week
- Collections of Papers and Projects for Multimodal Reasoning.☆104Updated 3 weeks ago
- A python script for downloading huggingface datasets and models.☆19Updated last month
- ☆36Updated last month
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆89Updated 3 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆104Updated 6 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆122Updated last year
- VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models☆57Updated 10 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆188Updated this week
- ☆95Updated last month
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆125Updated 10 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆73Updated 11 months ago
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆27Updated 6 months ago
- ☆46Updated 5 months ago