wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 11 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆236Updated 2 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆77Updated 10 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆92Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆176Updated 2 weeks ago
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆114Updated 8 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆126Updated 11 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆183Updated 7 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆34Updated last year
- Documents used for grad school application☆302Updated 4 years ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆139Updated 3 months ago
- [CVPR'25] A vision question answering (VQA) benchmark for 6D spatial reasoning.☆10Updated 3 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆40Updated 4 months ago
- Chat about anything on any video!☆36Updated 2 years ago
- ☆118Updated 2 years ago
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆312Updated 7 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆62Updated last year
- ☆31Updated 3 months ago
- ☆11Updated last month
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆148Updated last month
- Official Implementation of VideoDPO☆142Updated 3 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆76Updated last year
- Pytorch implementation for MeanFlow☆145Updated last month
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆140Updated last year
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆212Updated 3 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39Updated last year
- A list of works on video generation towards world model☆165Updated last month
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Updated 7 months ago