wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 9 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆198Updated 3 weeks ago
- ElasticTok: Adaptive Tokenization for Image and Video☆74Updated 9 months ago
- DDN: A novel generative model with simple principles and unique properties. (ICLR 2025)☆25Updated this week
- Sample LaTex file for HKU PhD thesis.☆26Updated 3 years ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆138Updated 2 months ago
- [CVPR'25] A vision question answering (VQA) benchmark for 6D spatial reasoning.☆10Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆39Updated 3 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆153Updated this week
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆126Updated 5 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆90Updated last year
- A survey for visual generation alignment☆30Updated this week
- ☆258Updated last year
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆32Updated 11 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆129Updated 3 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆186Updated 3 weeks ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆61Updated last year
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆15Updated 4 months ago
- TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆24Updated this week
- LaTex Poster for SDPS-Net (CVPR 2019)☆33Updated 6 years ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆36Updated last month
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆34Updated last year
- Documents used for grad school application☆303Updated 4 years ago
- Source code for my homepage.☆12Updated this week
- A list of works on video generation towards world model☆162Updated this week
- ☆34Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆129Updated 6 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆39Updated last year
- ☆29Updated 2 months ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆201Updated 2 months ago