jacklishufan / Reflect-DiTView external linksLinks
Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection
☆55Aug 16, 2025Updated 5 months ago
Alternatives and similar repositories for Reflect-DiT
Users that are interested in Reflect-DiT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 4 months ago
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆25Aug 5, 2025Updated 6 months ago
- Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆19Nov 19, 2024Updated last year
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆20Feb 23, 2025Updated 11 months ago
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Nov 15, 2025Updated 2 months ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆234Jan 22, 2026Updated 3 weeks ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆40Jul 23, 2025Updated 6 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆85Mar 16, 2025Updated 10 months ago
- The program used to occupy GPUs.☆10Mar 24, 2023Updated 2 years ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated last month
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆15Feb 2, 2026Updated last week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆307Sep 28, 2025Updated 4 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆130May 16, 2025Updated 8 months ago
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆21Aug 5, 2025Updated 6 months ago
- ☆13Jul 10, 2024Updated last year
- Orienting Latent Actions for Video World Modeling☆48Updated this week
- ☆13Jan 22, 2025Updated last year
- a collection of awesome autoregressive visual generation models☆79Apr 17, 2025Updated 9 months ago
- GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities☆305May 3, 2025Updated 9 months ago
- ☆16May 13, 2025Updated 9 months ago
- CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models☆10Feb 3, 2026Updated last week
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆163Oct 21, 2025Updated 3 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Mar 17, 2025Updated 10 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆625Oct 29, 2025Updated 3 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆51Jun 17, 2025Updated 7 months ago
- [ICCV 2025] MagicMirror: ID-Preserved Video Generation in Video Diffusion Transformers☆128Jun 26, 2025Updated 7 months ago
- ☆189Dec 17, 2024Updated last year
- [ICCV 2025] Prompt-A-Video☆20Feb 2, 2025Updated last year
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 7 months ago
- ☆18Mar 21, 2025Updated 10 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 8 months ago
- ☆62Jun 25, 2024Updated last year
- ☆21Jul 25, 2025Updated 6 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆310Jan 31, 2025Updated last year
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆320Mar 30, 2025Updated 10 months ago