☆20Jan 1, 2026Updated 2 months ago
Alternatives and similar repositories for ProfilingDiT
Users that are interested in ProfilingDiT are comparing it to the libraries listed below
Sorting:
- [Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…☆60Sep 22, 2025Updated 6 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 9 months ago
- [AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization☆30Jan 3, 2025Updated last year
- ☆53Dec 10, 2025Updated 3 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆131Apr 18, 2025Updated 11 months ago
- ☆37Feb 4, 2026Updated last month
- Papers and codes collection for customized, personalized and editable generative models☆28Oct 1, 2024Updated last year
- ☆54Mar 19, 2025Updated last year
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 5 months ago
- The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation☆39May 4, 2025Updated 10 months ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆21Feb 23, 2025Updated last year
- ☆30Mar 4, 2025Updated last year
- Exploring Representation-Aligned Latent Space for Better Generation☆18Updated this week
- [AAAI26] Next Patch Prediction☆132Jan 2, 2025Updated last year
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆50Mar 15, 2026Updated last week
- Official Code of "Distribution Matching Distillation Meets Reinforcement Learning"☆199Feb 1, 2026Updated last month
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆82Dec 10, 2025Updated 3 months ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆62Jul 1, 2025Updated 8 months ago
- ☆46Mar 12, 2026Updated last week
- Pytorch implementation of Self-Refining Video Sampling☆152Feb 6, 2026Updated last month
- ☆22Nov 18, 2025Updated 4 months ago
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 7 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆646Mar 6, 2026Updated 2 weeks ago
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆27Dec 11, 2025Updated 3 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆300Apr 23, 2025Updated 10 months ago
- [CVPR 2026] A training-free, mask-free framework for 3D shape editing.☆26Dec 12, 2025Updated 3 months ago
- [ AAAI26 ]: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation”☆17Mar 9, 2026Updated last week
- the official code of DriveMonkey☆45Updated this week
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆106Feb 25, 2026Updated 3 weeks ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆23Mar 15, 2026Updated last week
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 3 months ago
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆12Aug 26, 2025Updated 6 months ago
- Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework f…☆25Nov 4, 2025Updated 4 months ago
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- ☆54May 6, 2025Updated 10 months ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆29Jul 14, 2024Updated last year
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆121Dec 17, 2025Updated 3 months ago