wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆33Updated 3 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆107Updated 7 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆61Updated last year
- Accepted by CVPR 2024☆33Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆14Updated this week
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆110Updated this week
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆105Updated this week
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆87Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆34Updated 3 weeks ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆38Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generation☆124Updated 4 months ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 11 months ago
- A list of works on video generation towards world model☆113Updated this week
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆24Updated 3 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆33Updated last year
- ☆16Updated last year
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆155Updated 2 months ago
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Updated 2 years ago
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN☆33Updated this week
- DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. [ACMMM 2024] Official PyTorch implementation☆32Updated 8 months ago
- Official code for MotionBench (CVPR 2025)☆40Updated 3 months ago
- Idempotent Generative Network's unofficial pytorch implementation☆45Updated last year
- Documents used for grad school application☆302Updated 3 years ago
- ICLR2024 statistics☆47Updated last year
- Sample LaTex file for HKU PhD thesis.☆24Updated 3 years ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 3 months ago
- ☆44Updated 3 months ago
- An Examination of the Compositionality of Large Generative Vision-Language Models☆19Updated last year