Gaiejj / omniairlLinks
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated 2 years ago
Alternatives and similar repositories for omniairl
Users that are interested in omniairl are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆30Updated 8 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆345Updated 6 months ago
- ☆37Updated 3 months ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆62Updated 2 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆372Updated 7 months ago
- The homework of robos learning base.☆11Updated 2 years ago
- List of papers about Large Multimodal model☆28Updated last month
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆698Updated 2 weeks ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆14Updated last month
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆241Updated last year
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆59Updated 6 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆64Updated last month
- Visualizing the attention of vision-language models☆207Updated 4 months ago
- ☆341Updated last year
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆35Updated 3 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆31Updated 4 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆32Updated 11 months ago
- ☆129Updated 5 months ago
- ☆20Updated last month
- Paper List of Inference/Test Time Scaling/Computing☆280Updated 2 weeks ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆335Updated 4 months ago
- ☆48Updated 7 months ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆20Updated 4 months ago
- 抢占显卡☆71Updated 9 months ago
- Official implementation of the NeurIPS 2024 paper CORY☆17Updated 4 months ago
- Official github repo for SafeDialBench, a comprehensive multi-turn dialogue benchmark to evaluate LLMs' safety.☆33Updated 2 months ago
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆40Updated 7 months ago
- Collections of Papers and Projects for Multimodal Reasoning.☆105Updated 2 months ago
- Official repository for VisionZip (CVPR 2025)☆321Updated last month
- Documents used for grad school application☆302Updated 4 years ago