jiawn-creator / Dynamic-DiTView external linksLinks
☆18Mar 21, 2025Updated 10 months ago
Alternatives and similar repositories for Dynamic-DiT
Users that are interested in Dynamic-DiT are comparing it to the libraries listed below
Sorting:
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- ☆21Jun 3, 2023Updated 2 years ago
- Mixture-of-Groups Attention for End-to-End Long Video Generation☆92Oct 22, 2025Updated 3 months ago
- 中科大跨模态智能组-每周论文分享☆16Nov 20, 2022Updated 3 years ago
- Pytorch implementation for Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation.☆18Jan 4, 2022Updated 4 years ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆36Jun 21, 2025Updated 7 months ago
- LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation☆38Mar 3, 2025Updated 11 months ago
- “SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity” by Peihao Wang, Zhiwen Fan, Dejia Xu, Dilin Wang,…☆35Jan 5, 2024Updated 2 years ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆44Sep 30, 2024Updated last year
- ☆30Jan 23, 2026Updated 3 weeks ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Feb 11, 2023Updated 3 years ago
- ☆100Nov 6, 2025Updated 3 months ago
- Official PyTorch Implementation of SinGRAF (CVPR2023)☆11Jun 28, 2023Updated 2 years ago
- Official implementation of "Imaginarium: Vision-guided High-quality 3D Scene Layout Generation"☆41Dec 30, 2025Updated last month
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 5 months ago
- ☆10Nov 9, 2023Updated 2 years ago
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 6 months ago
- [ICLR 2026] DiMeR: Disentangled Mesh Reconstruction Model with Normal-only Geometry Training☆51May 26, 2025Updated 8 months ago
- DreamStyle: A Unified Framework for Video Stylization☆110Jan 7, 2026Updated last month
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated 11 months ago
- blender scripts for shapenet☆11Oct 12, 2020Updated 5 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 5 years ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆13Mar 3, 2025Updated 11 months ago
- The DeepRacer Sensor Fusion ROS package creates the sensor_fusion_node that is responsible for collecting the messages from all the senso…☆12Oct 28, 2022Updated 3 years ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 7 months ago
- ☆11Nov 14, 2022Updated 3 years ago
- Self Evolving Large Multimodal Models with Continuous Rewards☆19Nov 21, 2025Updated 2 months ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Jan 14, 2026Updated 3 weeks ago
- ☆16Sep 1, 2025Updated 5 months ago
- [ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.☆152Jan 27, 2026Updated 2 weeks ago
- ICML2025☆63Aug 28, 2025Updated 5 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆191Jul 23, 2023Updated 2 years ago
- [ECCV 2022 oral] OpenLane: Large-scale Realistic 3D Lane Dataset. Redirect to https://github.com/OpenDriveLab/OpenLane.☆11Feb 7, 2023Updated 3 years ago
- A grab-bag of utilities for messing around with filesystems and 3D models☆13Nov 18, 2024Updated last year
- ☆22Jun 5, 2025Updated 8 months ago
- ☆25Sep 19, 2025Updated 4 months ago