Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.
☆113Jan 14, 2026Updated 4 months ago
Alternatives and similar repositories for Dream-VLX
Users that are interested in Dream-VLX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jul 9, 2025Updated 10 months ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- ☆28Jul 23, 2025Updated 10 months ago
- ☆26Aug 23, 2024Updated last year
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the paper 'Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance' (EMNLP 2025)☆30Dec 16, 2025Updated 5 months ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆75Mar 6, 2026Updated 2 months ago
- [ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning☆61Apr 7, 2026Updated last month
- ☆19Nov 4, 2024Updated last year
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆25Dec 6, 2024Updated last year
- ☆33Jun 24, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆118Apr 2, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR2026] Spatial Reasoning with Vision-Language Models☆53Jan 26, 2026Updated 4 months ago
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- ☆68May 2, 2026Updated 3 weeks ago
- nav2gpt: navigation based on llm and ros2☆18Jul 18, 2024Updated last year
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated 2 years ago
- Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)☆23May 12, 2023Updated 3 years ago
- [SIGGRAPH 2026] OmniRoam: World Wandering via Long-Horizon Panoramic Video Generation☆96Apr 8, 2026Updated last month
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆152Aug 26, 2024Updated last year
- Dynamic config system based on python classes☆12Jan 27, 2023Updated 3 years ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 5 months ago
- Official code repository of Shuffle-R1☆26Feb 23, 2026Updated 3 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Dec 16, 2025Updated 5 months ago
- ☆50Jan 28, 2025Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆82Nov 25, 2024Updated last year
- A collection of resources and information for concrete skills that are helpful when pursuing a PhD in computer science (specifically in M…☆23Apr 18, 2023Updated 3 years ago
- Official github repo of G-LLaVA☆149Feb 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- ☆58Jan 31, 2026Updated 3 months ago
- ☆28Feb 26, 2023Updated 3 years ago
- The official implementation of dLLM-Var☆34Nov 6, 2025Updated 6 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆90Jan 16, 2026Updated 4 months ago
- Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]☆191Mar 30, 2026Updated last month
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆26Aug 24, 2025Updated 9 months ago