Gaiejj / omniairl
A trustworthy benchmark for IAIR Reinforcement Learning homework
☆9Updated 2 years ago
Alternatives and similar repositories for omniairl:
Users that are interested in omniairl are comparing it to the libraries listed below
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆30Updated 7 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆27Updated last month
- ☆19Updated 2 years ago
- ☆107Updated last month
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆83Updated 2 months ago
- Video-R1: Towards Super Reasoning Ability in Video Understanding MLLMs☆105Updated last month
- [ICLR 2025] SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training☆16Updated 2 weeks ago
- Official repository for VisionZip (CVPR 2025)☆259Updated last month
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 8 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆54Updated 2 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆11Updated 5 months ago
- [ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"☆64Updated 2 weeks ago
- ☆50Updated last week
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆95Updated 5 months ago
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆119Updated 2 weeks ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆278Updated 3 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆53Updated last week
- [CVPR2024] This is the official implement of MP5☆99Updated 8 months ago
- [CVPR 2025] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆40Updated 3 weeks ago
- Awesome RL-based LLM Reasoning☆341Updated last week
- SOTA RL fine-tuning solution for advanced math reasoning of LLM☆91Updated this week
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆138Updated 3 weeks ago
- Survey on Data-centric Large Language Models☆81Updated 8 months ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".☆80Updated 3 weeks ago
- The homework of robos learning base.☆10Updated last year
- 关于LLM和Multimodal LLM的paper list☆31Updated this week
- Code for our ICML'24 on multimodal dataset distillation☆36Updated 5 months ago
- ☆16Updated 11 months ago
- ☆69Updated 3 months ago
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆49Updated last week