GeekGuru123 / ProfilingDiT
☆11Updated 3 weeks ago
Alternatives and similar repositories for ProfilingDiT:
Users that are interested in ProfilingDiT are comparing it to the libraries listed below
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆37Updated last month
- Official Implementation of VideoDPO☆96Updated this week
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆40Updated 2 weeks ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆48Updated this week
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆69Updated 2 weeks ago
- ☆26Updated 9 months ago
- ☆39Updated last year
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆73Updated 2 weeks ago
- Official implementation of MTM☆21Updated last year
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆34Updated last week
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Updated last year
- ☆33Updated 6 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆30Updated last month
- ☆26Updated last month
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆20Updated 9 months ago
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆28Updated last year
- The official repository of DreamMover☆31Updated 7 months ago
- Frequency Autoregressive Image Generation with Continuous Tokens☆56Updated last month
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- ☆80Updated 11 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆39Updated 3 weeks ago
- [ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation☆52Updated last month
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆36Updated last month
- open-sourced video dataset with dynamic scenes and camera movements annotation☆50Updated this week
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 7 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆18Updated 2 weeks ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆108Updated last week
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated 2 months ago
- Official implementation of Aurora☆82Updated last year
- Official Implementation for "ReMOVE: A Reference-free Metric for Object Erasure"☆16Updated 11 months ago