Gaiejj / omniairlLinks
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated 2 years ago
Alternatives and similar repositories for omniairl
Users that are interested in omniairl are comparing it to the libraries listed below
Sorting:
- The homework of robos learning base.☆11Updated 2 years ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆360Updated 7 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆32Updated 11 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆70Updated 3 months ago
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆33Updated 9 months ago
- ☆38Updated 4 months ago
- An example reproduction checklist for AAAI-26 submissions.☆106Updated last week
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆69Updated 2 months ago
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Updated last year
- ☆18Updated 9 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆241Updated last year
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆378Updated 7 months ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆20Updated 5 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆186Updated 3 weeks ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆149Updated 4 months ago
- ☆344Updated last year
- 关于LLM和Multimodal LLM的paper list☆42Updated last month
- NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆106Updated 2 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆805Updated 3 weeks ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆130Updated last year
- ☆134Updated 5 months ago
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆58Updated 2 months ago
- A tiny paper rating web☆39Updated 4 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆37Updated 3 months ago
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆138Updated 2 months ago
- Official repository for VisionZip (CVPR 2025)☆329Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆268Updated last week
- ☆103Updated last month
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆339Updated 4 months ago
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Updated last year