[ICML 2025] Implementation of Spatial Reasoning with Denoising Models
☆85Jul 18, 2025Updated 7 months ago
Alternatives and similar repositories for SRM
Users that are interested in SRM are comparing it to the libraries listed below
Sorting:
- MEt3R: Measuring Multi-View Consistency in Generated Images☆161Feb 23, 2026Updated last week
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Dec 13, 2024Updated last year
- ☆10Nov 17, 2022Updated 3 years ago
- [CoRL 2024] Software and hardware instructions for SoniceSense.☆15Mar 1, 2025Updated last year
- 3D Scene Flow Estimation☆14Sep 24, 2025Updated 5 months ago
- ☆13Jul 17, 2024Updated last year
- Code for Principal Masked Autoencoders☆30Feb 4, 2026Updated 3 weeks ago
- ☆13Feb 2, 2023Updated 3 years ago
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆22Oct 22, 2025Updated 4 months ago
- Official implementation of the paper "STARS: Self-supervised 3D Action Recognition with Contrastive Tuning".☆13Jan 6, 2025Updated last year
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- Reviews of papers on ML, DL, Statistics, Optimization, etc.☆12Aug 2, 2021Updated 4 years ago
- DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion, ICCV 2023☆62Jun 21, 2024Updated last year
- ☆12May 5, 2024Updated last year
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆21Mar 23, 2025Updated 11 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆36Updated this week
- [ICLR2026] Spatial Reasoning with Vision-Language Models☆36Jan 26, 2026Updated last month
- Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding☆31Dec 23, 2025Updated 2 months ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 4 months ago
- Blender addon for vggt 3D reconstruction☆77Jun 29, 2025Updated 8 months ago
- [IROS2024] STAIR: Semantic-Targeted Active Implicit Reconstruction☆17Aug 3, 2024Updated last year
- [ECCV 2024] DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing☆125Jul 19, 2025Updated 7 months ago
- Official code for ECCV 2024 paper: Learn to Optimize Denoising Scores A Unified and Improved Diffusion Prior for 3D Generation☆72Jul 11, 2024Updated last year
- [ICRA'24] Influence of Camera-LiDAR Configuration on 3D Object Detection for Autonomous Driving☆21Sep 14, 2024Updated last year
- Evaluating Multiview Object Correspondence between Humans and Image models☆20Feb 12, 2025Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆21Oct 5, 2023Updated 2 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 11 months ago
- STeP: a general and scalable framework for solving video inverse problems with spatiotemporal diffusion priors☆29Jun 10, 2025Updated 8 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆31Jun 12, 2025Updated 8 months ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆29Jul 7, 2025Updated 7 months ago
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- ☆23Feb 4, 2023Updated 3 years ago
- Random Mesh Projectors for Inverse Problems☆24Apr 13, 2021Updated 4 years ago
- 3D human pose and shape estimation pipeline for multi-person videos, built around the NLF pose and shape model (NeurIPS'24)☆31May 22, 2025Updated 9 months ago
- Official Implementation of Posterior Distillation Sampling☆93Jul 7, 2025Updated 7 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆85May 26, 2025Updated 9 months ago
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated 11 months ago
- [T-PAMI2025] Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy☆28Jan 13, 2025Updated last year