Towards Generalizable Robotic Manipulation in Dynamic Environments
☆34Mar 17, 2026Updated this week
Alternatives and similar repositories for DOMINO
Users that are interested in DOMINO are comparing it to the libraries listed below
Sorting:
- ☆23Jun 5, 2025Updated 9 months ago
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆238Jul 14, 2025Updated 8 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 11 months ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆20Sep 24, 2025Updated 5 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated last year
- Streaming Thinking for VideoLLM Streaming Video Understanding☆71Mar 13, 2026Updated last week
- ☆56Oct 3, 2024Updated last year
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated 3 weeks ago
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching☆291Aug 29, 2025Updated 6 months ago
- the official code of DriveMonkey☆44May 24, 2025Updated 9 months ago
- ☆41Feb 27, 2026Updated 3 weeks ago
- [ICRA 2026] UniFuture: A 4D Driving World Model for Future Generation and Perception☆145Feb 26, 2026Updated 3 weeks ago
- Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.☆111Jan 14, 2026Updated 2 months ago
- Streaming Video Instruction Tuning☆53Feb 25, 2026Updated 3 weeks ago
- Quantized training of Stable Diffusion 3 Medium to significantly reduce memory usage.☆16Jul 10, 2024Updated last year
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Oct 31, 2025Updated 4 months ago
- The Chongqing University Bituminous Pavement Disease Detection Dataset (CQU-BPDD)☆13Apr 17, 2025Updated 11 months ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆23Nov 23, 2025Updated 3 months ago
- [ICCV 2025] Official code of "ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation"☆595Dec 10, 2025Updated 3 months ago
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]☆183Mar 12, 2026Updated last week
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆173Jul 7, 2025Updated 8 months ago
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆39Feb 24, 2026Updated 3 weeks ago
- [NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆92Sep 20, 2025Updated 6 months ago
- ☆14Sep 11, 2025Updated 6 months ago
- [CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer☆92May 26, 2025Updated 9 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆88Jan 16, 2026Updated 2 months ago
- ☆18Jan 8, 2026Updated 2 months ago
- ☆33Updated this week
- [AAAI 2025]MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation☆37Jul 26, 2025Updated 7 months ago
- Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”☆182Feb 12, 2026Updated last month
- Official code repository of Shuffle-R1☆44Feb 23, 2026Updated 3 weeks ago
- Generate Gibson task dataset for objectnav☆16Aug 27, 2020Updated 5 years ago
- ☆32Jan 30, 2026Updated last month
- [ICCV 2025] Improving 3D Large Language Model via Robust Instruction Tuning☆69Oct 19, 2025Updated 5 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆354Dec 18, 2025Updated 3 months ago
- ☆12Mar 22, 2025Updated 11 months ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 5 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆51Sep 21, 2024Updated last year
- Interface between mc_rtc and libfranka☆12Mar 10, 2026Updated last week