wl-zhao / THU-Courses
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses:
Users that are interested in THU-Courses are comparing it to the libraries listed below
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆99Updated 6 months ago
- ☆16Updated last year
- Official code for MotionBench (CVPR 2025)☆34Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆28Updated last month
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆30Updated 2 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Updated last month
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆48Updated 2 weeks ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated last year
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆38Updated 11 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆85Updated last year
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆33Updated last week
- Documents used for grad school application☆303Updated 3 years ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆71Updated this week
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆54Updated 3 weeks ago
- Accepted by CVPR 2024☆33Updated 11 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆120Updated last year
- ☆19Updated 3 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆119Updated 3 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆31Updated 11 months ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆31Updated 5 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆66Updated 5 months ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆59Updated last year
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Updated 2 years ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated last year
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆65Updated 2 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆80Updated 2 weeks ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 9 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆43Updated 10 months ago