Jialuo-Li / Science-T2I
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
☆23Updated this week
Alternatives and similar repositories for Science-T2I:
Users that are interested in Science-T2I are comparing it to the libraries listed below
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆71Updated this week
- ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning☆28Updated 2 weeks ago
- ☆19Updated this week
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆42Updated last week
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆28Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated last month
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆21Updated last month
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆42Updated 2 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆90Updated this week
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆37Updated this week
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆22Updated this week
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆67Updated last week
- A collection of vision foundation models unifying understanding and generation.☆50Updated 3 months ago
- ☆47Updated 4 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆79Updated last week
- ☆146Updated this week
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆89Updated last month
- ☆33Updated 6 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆48Updated this week
- ☆43Updated this week
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆34Updated this week
- [CVPR 2025] GPS as a Control Signal for Image Generation☆16Updated last month
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆107Updated last month
- ☆28Updated 4 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated 6 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆36Updated last week
- FQGAN: Factorized Visual Tokenization and Generation☆47Updated 3 weeks ago
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆35Updated last month
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆44Updated 3 weeks ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆69Updated last month