wufeim / DST3D
Official implementation of "Generating images with 3D annotations using diffusion models".
☆47Updated 8 months ago
Alternatives and similar repositories for DST3D:
Users that are interested in DST3D are comparing it to the libraries listed below
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated 10 months ago
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated 10 months ago
- ☆21Updated last year
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated 2 weeks ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆50Updated 2 months ago
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆34Updated last year
- ☆32Updated 2 years ago
- ☆45Updated 5 months ago
- ☆29Updated 2 years ago
- 【 ICLR 2025 】I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength☆108Updated last month
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 9 months ago
- Implementation of RSGC-BD (Blur Detection)☆47Updated 7 months ago
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆66Updated last year
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 6 months ago
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆36Updated 9 months ago
- ☆24Updated this week
- ☆58Updated 11 months ago
- ☆80Updated 5 months ago
- ☆62Updated 2 years ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated last month
- HiWilliamWWL / Learn-to-Predict-How-Humans-Manipulate-Large-Sized-Objects-From-Interactive-Motions-objectsThis is the repo for the paper "Learn to Predict How Humans Manipulate Large-Sized Objects From Interactive Motions"☆20Updated last year
- ☆43Updated last year
- DeDA: Differentiable Image Integration Library☆19Updated last year
- Panorama Generation as a Next-Token Prediction Task.☆19Updated 3 weeks ago
- deep 3d reconstruction☆25Updated last year
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆53Updated last month
- My open-source C++ software for 3D coral rugosity computation (made when I was in HKUST).☆34Updated last year
- [AAAI 2021] VMLoc: Variational Fusion For Learning-Based Multimodal Camera Localization☆30Updated 5 months ago
- MMDepth: Comprehensive MMEngine-based Framework for Monocular, Stereo & Multi-view Depth Estimation☆99Updated last month
- Diffusion-Driven Self-Supervised Network for Multi-Object 3D Shape Reconstruction and Categorical 6-DoF Pose Estimation☆27Updated last year