fudan-generative-vision / WAM-DiffLinks
WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving
β166Updated last week
Alternatives and similar repositories for WAM-Diff
Users that are interested in WAM-Diff are comparing it to the libraries listed below
Sorting:
- β93Updated 7 months ago
- π Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systemsβ135Updated last week
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Drivingβ171Updated last week
- DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Drivingβ151Updated last month
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"β199Updated 3 years ago
- π WorldLens: Full-Spectrum Evaluations of Driving World Models in Real Worldβ178Updated 3 weeks ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β270Updated 3 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ215Updated 3 months ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!β136Updated 4 months ago
- hybrid sfm with VIO Pose,RGB and depth dataβ52Updated 2 years ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Executionβ356Updated last month
- β321Updated 3 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ482Updated 3 weeks ago
- A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.β140Updated last month
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β380Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understandingβ350Updated last month
- β246Updated last year
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphyβ188Updated last week
- β210Updated last week
- The Collapse of Patchesβ58Updated 2 months ago
- β19Updated 9 months ago
- β389Updated 6 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Predictionβ130Updated 4 months ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory Systemβ100Updated 6 months ago
- β15Updated last year
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning wβ¦β51Updated 11 months ago
- Llama from scratch in CUDA with Flash Attention.β43Updated 3 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"β66Updated last month
- DPO-Shift: Shifting the Distribution of Direct Preference Optimizationβ59Updated 11 months ago
- A lightweight, high-performance deep learning inference tool.β51Updated last month