fudan-generative-vision / WAM-DiffLinks
WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving
☆166Updated last week
Alternatives and similar repositories for WAM-Diff
Users that are interested in WAM-Diff are comparing it to the libraries listed below
Sorting:
- ☆93Updated 7 months ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆171Updated last week
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆135Updated last week
- DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving☆150Updated last month
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆270Updated 3 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆380Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 3 months ago
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆178Updated 3 weeks ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆482Updated 3 weeks ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- hybrid sfm with VIO Pose,RGB and depth data☆52Updated 2 years ago
- ☆321Updated 3 months ago
- A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.☆140Updated last month
- ☆246Updated last year
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated last week
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆130Updated 4 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆136Updated 4 months ago
- The Collapse of Patches☆58Updated 2 months ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆51Updated 11 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆59Updated 11 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated last month
- ☆389Updated 6 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆1,003Updated 2 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆574Updated last month
- ☆210Updated last week
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 5 months ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆99Updated 5 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆160Updated last month