wl-zhao / THU-Courses
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆102Updated 6 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆91Updated this week
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆29Updated last week
- Documents used for grad school application☆302Updated 3 years ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆86Updated last year
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆31Updated 8 months ago
- ☆44Updated 3 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆67Updated 6 months ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 10 months ago
- Accepted by CVPR 2024☆33Updated last year
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆129Updated 2 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆86Updated last month
- ☆20Updated 5 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆132Updated last year
- ☆21Updated last year
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆32Updated 6 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆11Updated 7 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆36Updated 3 weeks ago
- Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper☆105Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- ☆9Updated 2 years ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- Chat about anything on any video!☆36Updated last year
- Sample LaTex file for HKU PhD thesis.☆23Updated 3 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆128Updated 9 months ago
- A list of works on video generation towards world model☆58Updated this week
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆140Updated last month
- ☆20Updated 2 years ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆83Updated last month