[ICML 2025] Implementation of Spatial Reasoning with Denoising Models
☆85Jul 18, 2025Updated 9 months ago
Alternatives and similar repositories for SRM
Users that are interested in SRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CODEML @ ICML2025] Official implementation of the 🌀Spatial Reasoners python package☆39Jan 5, 2026Updated 3 months ago
- Official Implementation of "Interpretable 3D Neural Object Volumes for Robust Conceptual Reasoning." ICLR 2026.☆30Feb 3, 2026Updated 3 months ago
- [CVPR '25] MEt3R: Measuring Multi-View Consistency in Generated Images☆170Feb 23, 2026Updated 2 months ago
- [3DV '25] Official repository of the paper "Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes".☆32Dec 2, 2024Updated last year
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Dec 13, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open source Multi-View Latent Diffusion Model☆44Feb 23, 2026Updated 2 months ago
- ☆12May 9, 2023Updated 2 years ago
- Reviews of papers on ML, DL, Statistics, Optimization, etc.☆12Aug 2, 2021Updated 4 years ago
- ☆13Feb 2, 2023Updated 3 years ago
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆30Mar 18, 2026Updated last month
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 10 months ago
- [ISPRS 2026] The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization☆38Dec 11, 2025Updated 4 months ago
- ☆14Jul 17, 2024Updated last year
- The official repository for DreamSampler (ECCV24)☆37Oct 11, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ICCV 2023: Iterative Superquadric Recomposition of 3D Objects from Multiple Views☆16Oct 5, 2023Updated 2 years ago
- ☆44May 9, 2025Updated 11 months ago
- DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion, ICCV 2023☆64Jun 21, 2024Updated last year
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆36Oct 23, 2024Updated last year
- ☆21Jan 23, 2026Updated 3 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆180Mar 18, 2026Updated last month
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 10 months ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Evaluating Multiview Object Correspondence between Humans and Image models☆20Feb 12, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Jan 19, 2026Updated 3 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Simple Tensorflow implementation of "SDIT: Scalable and Diverse Cross-domain Image Translation" (ACM-MM 2019)☆16Oct 14, 2019Updated 6 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 9 months ago
- Official Implementation of Posterior Distillation Sampling☆93Jul 7, 2025Updated 9 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- [CoRL 2024] Software and hardware instructions for SoniceSense.☆15Mar 1, 2025Updated last year
- Video Diffusion State Space Models☆19Mar 27, 2024Updated 2 years ago
- Official code for ECCV 2024 paper: Learn to Optimize Denoising Scores A Unified and Improved Diffusion Prior for 3D Generation☆72Jul 11, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- STeP: a general and scalable framework for solving video inverse problems with spatiotemporal diffusion priors☆31Jun 10, 2025Updated 10 months ago
- Official implementation of "Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation" (NeurIPS 2025)☆20Apr 2, 2026Updated last month
- [ECCV 2024] DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing☆130Jul 19, 2025Updated 9 months ago
- ConDense backbone, weights, and evaluation code.☆30Jun 27, 2024Updated last year
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆22Mar 23, 2025Updated last year
- [SIGGRAPH 2025] 3D Stylization via Large Reconstruction Model☆32Oct 14, 2025Updated 6 months ago
- Spatial Aptitude Training for Multimodal Langauge Models☆31Feb 8, 2026Updated 2 months ago