wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆127Updated 11 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆92Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆249Updated last week
- ElasticTok: Adaptive Tokenization for Image and Video☆80Updated 11 months ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆142Updated 4 months ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆135Updated last month
- Accepted by CVPR 2024☆39Updated last year
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆130Updated 5 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆184Updated 8 months ago
- ☆11Updated 2 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆41Updated 5 months ago
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆30Updated 6 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆33Updated last year
- ☆33Updated 4 months ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆41Updated 3 months ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated 2 years ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆183Updated last week
- Official Implementation of VideoDPO☆145Updated 4 months ago
- Chat about anything on any video!☆36Updated 2 years ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆25Updated last year
- [CVPR’25] PIVRG & ConsMTL☆16Updated 4 months ago
- ☆53Updated 2 months ago
- Sample LaTex file for HKU PhD thesis.☆26Updated 3 years ago
- ☆21Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated 2 years ago
- Documents used for grad school application☆303Updated 4 years ago
- A Pytorch Implementation of Finite Scalar Quantization☆161Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆132Updated 8 months ago
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆33Updated last year