XZPKU / SA-HOI
☆9Updated 10 months ago
Alternatives and similar repositories for SA-HOI:
Users that are interested in SA-HOI are comparing it to the libraries listed below
- ☆38Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆42Updated last week
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆25Updated this week
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆29Updated 4 months ago
- ☆14Updated 3 weeks ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆25Updated 8 months ago
- DDS: Delta Denoising Score PyTorch implementation☆18Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated last month
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆14Updated 2 weeks ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- Video Diffusion State Space Models☆19Updated last year
- ☆21Updated 9 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 5 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- ☆19Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆44Updated 3 months ago
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆39Updated 5 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 10 months ago
- ☆23Updated last month
- Official implementation of MTM☆21Updated last year
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆36Updated last month
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆29Updated last month
- ☆10Updated 8 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 4 months ago
- [ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization☆13Updated 3 months ago
- ☆16Updated last year
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆33Updated 3 weeks ago
- [CVPR2025] Official PyTorch implementation of "Optical-Flow Guided Prompt Optimization for Coherent Video Generation (Motion Prompt)"☆16Updated 3 weeks ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- ☆21Updated 3 months ago