Gaiejj / omniairlLinks

A trustworthy benchmark for IAIR Reinforcement Learning homework

☆9

Alternatives and similar repositories for omniairl

Users that are interested in omniairl are comparing it to the libraries listed below

Sorting:

kid-yang233 / robots
The homework of robos learning base.
☆11Updated 2 years ago
deepcs233 / Visual-CoT
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …
☆360Updated 7 months ago
PKU-Alignment / safe-sora
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…
☆32Updated 11 months ago
jungao1106 / ICoT
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆70Updated 3 months ago
microsoft / visualization-of-thought
[NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.
☆33Updated 9 months ago
yanghlll / ScalingNoise
☆38Updated 4 months ago
QuenithAI / aaai-26-reproduction-checklist
An example reproduction checklist for AAAI-26 submissions.
☆106Updated last week
zhyang2226 / OPA-DPO
[CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
☆69Updated 2 months ago
ivattyue / SC-Tune
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Updated last year
wutianyuan1 / GPU-preempter
☆18Updated 9 months ago
ziqipang / LM4VisualEncoding
[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"
☆241Updated last year
RL4VLM / RL4VLM
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
☆378Updated 7 months ago
Hui-design / R1-Video-fixbug
[Blog 1] Recording a bug of grpo_trainer in some R1 projects
☆20Updated 5 months ago
mrwu-mac / ControlMLLM
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆186Updated 3 weeks ago
Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List
🔥CVPR 2025 Multimodal Large Language Models Paper List
☆149Updated 4 months ago
tsb0601 / MMVP
☆344Updated last year
NOVAglow646 / LLM-MLLM-paper-list
关于LLM和Multimodal LLM的paper list
☆42Updated last month
AMAP-ML / NarrLV
NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models
☆106Updated 2 weeks ago
zhaochen0110 / Awesome_Think_With_Images
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆805Updated 3 weeks ago
Chenyu-Wang567 / MLLM-Tool
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
☆130Updated last year
HKUST-LongGroup / Awesome-MLLM-Benchmarks
☆134Updated 5 months ago
minglllli / CLS-RL
Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning
☆58Updated 2 months ago
WayneJin0918 / SOTA-paper-rating.io
A tiny paper rating web
☆39Updated 4 months ago
chengzu-li / MVoT
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)
☆37Updated 3 months ago
Gumpest / SparseVLMs
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆138Updated 2 months ago
dvlab-research / VisionZip
Official repository for VisionZip (CVPR 2025)
☆329Updated 3 weeks ago
Purshow / Awesome-Unified-Multimodal
📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.
☆268Updated last week
OpenDCAI / Awesome_MLLMs_Reasoning
☆103Updated last month
Atomic-man007 / Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…
☆339Updated 4 months ago
Trent-Fellbootman / dev000
A complete introductory course to programming, computer systems and software development (continuously updating).
☆12Updated last year