wl-zhao / THU-CoursesLinks
☆17Updated 3 years ago
Alternatives and similar repositories for THU-Courses
Users that are interested in THU-Courses are comparing it to the libraries listed below
Sorting:
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆143Updated last year
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆34Updated last year
- ☆37Updated 2 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆94Updated last year
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆144Updated 6 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆277Updated 2 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆87Updated last year
- A framework that allows you to apply Sparse AutoEncoder on any models☆47Updated 5 months ago
- A survey for visual generation alignment☆100Updated last month
- a reading list for human-centered AI☆44Updated 3 years ago
- Sample LaTex file for HKU PhD thesis.☆26Updated 3 years ago
- ICCV2023-Diffusion-Papers☆108Updated 2 years ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆138Updated 10 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- Source code for my homepage.☆14Updated 3 weeks ago
- Official Implementation of "Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling"☆16Updated 2 years ago
- Denoising Diffusion Step-aware Models (ICLR2024)☆61Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated 2 years ago
- Documents used for grad school application☆309Updated 4 years ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆136Updated last year
- PAI-Bench: A Comprehensive Benchmark for Physical AI☆38Updated 2 weeks ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆187Updated 10 months ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated 2 years ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆164Updated last month
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆30Updated last year
- ☆67Updated 4 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆136Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Updated 11 months ago
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Updated 2 years ago
- [WIP🚧] 2025 up-to-date list of resources on visual tokenizers (primarily for visual generation). Give it a star 🌟 if you find it useful…☆20Updated 11 months ago