Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.
☆113Jan 14, 2026Updated 2 months ago
Alternatives and similar repositories for Dream-VLX
Users that are interested in Dream-VLX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jul 9, 2025Updated 8 months ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Towards Generalizable Robotic Manipulation in Dynamic Environments☆34Mar 17, 2026Updated last week
- ☆21May 24, 2024Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆27Jul 23, 2025Updated 8 months ago
- ☆25Aug 23, 2024Updated last year
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated last year
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆51Mar 6, 2026Updated 3 weeks ago
- OmniGAIA: Towards Native Omni-Modal AI Agents☆82Mar 16, 2026Updated last week
- ☆18Nov 4, 2024Updated last year
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆23Dec 6, 2024Updated last year
- ☆33Jun 24, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- ☆10Mar 13, 2023Updated 3 years ago
- ☆59Nov 12, 2025Updated 4 months ago
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆150Aug 26, 2024Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders [Technical Report]☆159Updated this week
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated last month
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆88Jan 16, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A collection of resources and information for concrete skills that are helpful when pursuing a PhD in computer science (specifically in M…☆24Apr 18, 2023Updated 2 years ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 3 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆79Nov 25, 2024Updated last year
- ☆18Mar 20, 2022Updated 4 years ago
- Official github repo of G-LLaVA☆148Feb 20, 2025Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- ☆27Feb 26, 2023Updated 3 years ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 4 months ago
- Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies☆61Dec 3, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆47Mar 20, 2026Updated last week
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆71Feb 18, 2026Updated last month
- ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation☆25Aug 24, 2025Updated 7 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 3 years ago
- ☆43Updated this week
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]☆185Mar 12, 2026Updated 2 weeks ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 2 months ago