NVlabs / Long-RLLinks
Long-RL: Scaling RL to Long Sequences
☆592Updated this week
Alternatives and similar repositories for Long-RL
Users that are interested in Long-RL are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation☆382Updated 4 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆208Updated 4 months ago
- Official implementation of UnifiedReward & UnifiedReward-Think☆511Updated 3 weeks ago
- ☆208Updated last week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆276Updated 3 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆371Updated 2 weeks ago
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆345Updated last month
- [Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey☆447Updated 7 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆397Updated 2 months ago
- A Unified Tokenizer for Visual Generation and Understanding☆388Updated 3 weeks ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆455Updated 8 months ago
- ☆488Updated last week
- The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning".☆134Updated last month
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆206Updated 2 months ago
- Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]☆667Updated 3 weeks ago
- Long Context Transfer from Language to Vision☆390Updated 5 months ago
- Visual Planning: Let's Think Only with Images☆268Updated 3 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆72Updated last month
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆312Updated 2 months ago
- Explore the Multimodal “Aha Moment” on 2B Model☆607Updated 5 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆138Updated 3 weeks ago
- Pixel-Level Reasoning Model trained with RL☆194Updated last month
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆294Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Updated 7 months ago
- A Collection of Papers on Diffusion Language Models☆111Updated this week
- Official implementation of the Law of Vision Representation in MLLMs☆163Updated 9 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆133Updated last month
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆670Updated 3 weeks ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆277Updated 2 weeks ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆219Updated last month