Gaiejj / omniairl
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for omniairl
- Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"☆12Updated 4 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆135Updated last month
- [NIPS24W]This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated…☆73Updated 4 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆75Updated 2 months ago
- ☆16Updated 7 months ago
- The paper collections for the autoregressive models in vision.☆229Updated this week
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆225Updated 10 months ago
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆18Updated last month
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆26Updated 3 months ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆71Updated 3 weeks ago
- ☆83Updated 2 years ago
- The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models☆10Updated last month
- ☆61Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆204Updated last month
- ☆75Updated 3 weeks ago
- [CVPR2024] This is the official implement of MP5☆84Updated 4 months ago
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆16Updated last month
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆31Updated 6 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆9Updated last month
- Accepted by CVPR 2024☆28Updated 6 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆96Updated last week
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆23Updated 4 months ago
- Visualizing the attention of vision-language models☆72Updated 3 weeks ago
- A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!☆118Updated 10 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆37Updated 5 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated 7 months ago
- [NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations☆121Updated 7 months ago
- Paper collections of the continuous effort start from World Models.☆140Updated 4 months ago
- This is a repository for listing papers on scene graph generation and application.☆78Updated this week
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆70Updated 2 weeks ago