☆20Jan 1, 2026Updated 2 months ago
Alternatives and similar repositories for ProfilingDiT
Users that are interested in ProfilingDiT are comparing it to the libraries listed below
Sorting:
- [Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…☆59Sep 22, 2025Updated 5 months ago
- [AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization☆30Jan 3, 2025Updated last year
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 8 months ago
- ☆53Dec 10, 2025Updated 2 months ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆12Aug 26, 2025Updated 6 months ago
- Papers and codes collection for customized, personalized and editable generative models☆29Oct 1, 2024Updated last year
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- ☆32Feb 4, 2026Updated 3 weeks ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆16Jan 22, 2025Updated last year
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- [BMVC 2024] ControlDreamer enables high-quality 3D generation with creative geometry and styles via multi-view ControlNet.☆17Sep 28, 2024Updated last year
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 4 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆130Apr 18, 2025Updated 10 months ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆20Feb 23, 2025Updated last year
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Sep 8, 2025Updated 5 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆40Aug 29, 2025Updated 6 months ago
- ☆28Mar 4, 2025Updated 11 months ago
- Code for the paper "Invertible Neural BRDF for Object Inverse Rendering"☆20Oct 15, 2020Updated 5 years ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆29Jul 14, 2024Updated last year
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆301Apr 23, 2025Updated 10 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆629Feb 3, 2026Updated 3 weeks ago
- Official implementation of “LucidFusion: Reconstructing 3D Gaussians with Arbitrary Unposed Images”☆74Mar 21, 2025Updated 11 months ago
- the official code of DriveMonkey☆43May 24, 2025Updated 9 months ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆39May 4, 2025Updated 9 months ago
- Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"☆181Feb 1, 2026Updated last month
- [ICCV 2025] Amodal Depth Anything: Amodal Depth Estimation in the Wild☆39Feb 21, 2026Updated last week
- [3DV 2024] Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization☆33Mar 17, 2025Updated 11 months ago
- Official repository of IDEA-Bench☆39Jan 24, 2025Updated last year
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆80Dec 10, 2025Updated 2 months ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆61Jul 1, 2025Updated 8 months ago
- ☆65Feb 23, 2026Updated last week
- [CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion☆43Mar 21, 2025Updated 11 months ago
- a size profiler for cuda binary☆72Jan 15, 2026Updated last month
- In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.☆37Feb 16, 2026Updated 2 weeks ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- Cosmos-Transfer1-7B-Sample-AV Toolkits☆46Jun 11, 2025Updated 8 months ago
- ☆35Nov 5, 2024Updated last year