wl-zhao / THU-Courses
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses:
Users that are interested in THU-Courses are comparing it to the libraries listed below
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆95Updated 5 months ago
- ☆16Updated 11 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆26Updated last month
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆30Updated 7 months ago
- Documents used for grad school application☆302Updated 3 years ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆24Updated last week
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 8 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆119Updated 2 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated 11 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated last month
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆109Updated 10 months ago
- Accepted by CVPR 2024☆32Updated 10 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆84Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆11Updated 5 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆54Updated last week
- ☆43Updated last month
- ☆50Updated last week
- [CVPR 2025] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆42Updated 3 weeks ago
- [CVPR 2025] Open implementation of "RandAR"☆69Updated last week
- ☆106Updated last year
- ICLR2023 statistics☆60Updated last year
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆28Updated 7 months ago
- Official code for MotionBench (CVPR 2025)☆31Updated 3 weeks ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆86Updated 2 months ago
- Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation (ICCV 2023)☆23Updated last year
- ☆26Updated this week
- ElasticTok: Adaptive Tokenization for Image and Video☆64Updated 4 months ago
- ☆89Updated 3 months ago
- ☆21Updated 10 months ago