Gaiejj / omniairl
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated last year
Alternatives and similar repositories for omniairl:
Users that are interested in omniairl are comparing it to the libraries listed below
- ☆97Updated last month
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Updated 10 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆28Updated 4 months ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆78Updated 2 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆202Updated 3 weeks ago
- 关于LLM和Multimodal LLM的paper list☆23Updated 3 weeks ago
- The paper collections for the autoregressive models in vision.☆368Updated this week
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆229Updated last year
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆78Updated 4 months ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆67Updated 2 months ago
- ☆40Updated last month
- Official Repository of Multi-Object Hallucination in Vision-Language Models (NeurIPS 2024)☆26Updated 2 months ago
- [CVPR2024] This is the official implement of MP5☆92Updated 6 months ago
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆77Updated 6 months ago
- Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"☆219Updated 3 weeks ago
- ☆36Updated 2 weeks ago
- A tiny paper rating web☆27Updated this week
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆249Updated last month
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆136Updated this week
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆11Updated 3 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆105Updated 8 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆184Updated last month
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆229Updated 3 months ago
- Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"☆13Updated 6 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆17Updated this week
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆96Updated 2 weeks ago
- ☆65Updated last month
- A Self-Training Framework for Vision-Language Reasoning☆60Updated 2 months ago
- ☆88Updated 2 years ago
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆13Updated 3 weeks ago