wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- ElasticTok: Adaptive Tokenization for Image and Video☆83Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆134Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆271Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆186Updated 10 months ago
- A survey for visual generation alignment☆98Updated 3 weeks ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆93Updated last year
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆143Updated 6 months ago
- Official Implementation of VideoDPO☆147Updated 5 months ago
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆321Updated 10 months ago
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆124Updated 10 months ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆135Updated 3 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆62Updated last year
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆33Updated last year
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆33Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆138Updated 9 months ago
- An in-context conditioning version of MUSE with pre-trained checkpoints.☆116Updated 2 years ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆77Updated 2 years ago
- ☆36Updated 6 months ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated 2 years ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆96Updated 9 months ago
- ☆60Updated 3 months ago
- ☆118Updated 2 years ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆136Updated last year
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆180Updated last year
- ☆11Updated 4 months ago
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- Sample LaTex file for HKU PhD thesis.☆26Updated 3 years ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆209Updated this week
- ☆184Updated 11 months ago
- Documents used for grad school application☆307Updated 4 years ago