nv-tlabs / cosmos-av-sample-toolkitsLinks
Cosmos-Transfer1-7B-Sample-AV Toolkits
☆36Updated last month
Alternatives and similar repositories for cosmos-av-sample-toolkits
Users that are interested in cosmos-av-sample-toolkits are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆50Updated 2 weeks ago
- ☆88Updated 6 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 7 months ago
- [RA-L 2024] DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction☆80Updated 9 months ago
- ☆47Updated last month
- Project Page for GaussianFormer☆25Updated last year
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆76Updated this week
- [ICCV 2025] DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation☆66Updated 2 weeks ago
- ☆101Updated 7 months ago
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆30Updated 7 months ago
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆26Updated this week
- Official implement of VGGT-Long☆110Updated 2 weeks ago
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆31Updated 7 months ago
- Amodal Depth Anything: Amodal Depth Estimation in the Wild☆31Updated 6 months ago
- ☆53Updated last year
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆125Updated 3 months ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆15Updated 4 months ago
- Official Implementation of Driv3R☆90Updated 7 months ago
- Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models☆164Updated last week
- ☆20Updated 3 months ago
- Code for Streaming 4D Visual Geometry Transformer☆163Updated this week
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆58Updated last year
- ConDense backbone, weights, and evaluation code.☆32Updated last year
- ☆51Updated 11 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆214Updated 2 weeks ago
- [AAAI25 Oral] DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation☆40Updated 2 months ago
- CVPR 2025: VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction☆38Updated last week
- official code of "MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction"☆72Updated 3 months ago
- [NeurIPS 2023] 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection☆51Updated last year
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year