Stability-AI's SV3D (ECCV 2024 oral, Voleti et al.) in the diffusers convention.
☆31Feb 5, 2025Updated last year
Alternatives and similar repositories for sv3d-diffusers
Users that are interested in sv3d-diffusers are comparing it to the libraries listed below
Sorting:
- ☆13Sep 2, 2023Updated 2 years ago
- This repository contains the code for the IEEE Robotics and Automation Letters paper "Open-Set Object Detection Using Classification-Free…☆14Dec 6, 2023Updated 2 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- ☆14May 4, 2025Updated 9 months ago
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wild☆39Feb 21, 2026Updated last week
- Official JAX implementation of neural isometries - taming transformations for equivariant ML☆36Aug 1, 2025Updated 7 months ago
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- Official PyTorch Implementation of Opt-CWM: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals.☆22Mar 27, 2025Updated 11 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆67Feb 13, 2026Updated 2 weeks ago
- ☆21Feb 27, 2024Updated 2 years ago
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆44Oct 15, 2025Updated 4 months ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Jul 27, 2025Updated 7 months ago
- ☆22Sep 26, 2024Updated last year
- [NeurIPS'24] Official implementation of "HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors"☆159Updated this week
- These scripts are used to download RealEstate10K dataset.☆98Mar 22, 2024Updated last year
- ☆38Jan 8, 2026Updated last month
- An implementation of 'simple diffusion: End-to-end diffusion for high resolution images' as published by Hoogeboom et al.☆37Feb 9, 2025Updated last year
- ☆37May 23, 2025Updated 9 months ago
- Fine-tuning code for SV3D☆113Sep 9, 2024Updated last year
- VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model☆181May 8, 2024Updated last year
- ☆32Dec 20, 2023Updated 2 years ago
- Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image☆305Jun 2, 2025Updated 8 months ago
- Code for Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning☆36Jun 16, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- [ToG 2024]: DMHomo: Learning Homography with Diffusion Models☆30Oct 19, 2024Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆39Aug 23, 2025Updated 6 months ago
- 用ATSS训练自己的目标检测模型!! 超详细教程和PDF教程下载!!!☆10Jul 28, 2020Updated 5 years ago
- ☆51Aug 22, 2025Updated 6 months ago
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆49Jul 28, 2025Updated 7 months ago
- [3DV 2024] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization☆33Mar 17, 2025Updated 11 months ago
- A niche toolkit for 3D computer vision tasks.☆319Feb 3, 2026Updated 3 weeks ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆49Jun 17, 2025Updated 8 months ago
- [NeurIPS 2024] GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling☆424Dec 10, 2024Updated last year
- Marigold adapted for video estimation☆30Mar 30, 2024Updated last year
- 🔥Benchmarking Unsupervised Obj Seg (NeurIPS 2022 & IJCV 2024)☆36Aug 26, 2025Updated 6 months ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- [CVPR 2024] G3DR: Generative 3D Reconstruction in ImageNet☆38Jun 27, 2024Updated last year
- ☆32Feb 19, 2025Updated last year
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆41Sep 15, 2025Updated 5 months ago