anpwu / ZJU-CS-ClassNotesLinks
☆21Updated 3 years ago
Alternatives and similar repositories for ZJU-CS-ClassNotes
Users that are interested in ZJU-CS-ClassNotes are comparing it to the libraries listed below
Sorting:
- A paper list for spatial reasoning☆127Updated last month
- VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆54Updated 2 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.☆23Updated 2 months ago
- A collection of vision foundation models unifying understanding and generation.☆57Updated 7 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆136Updated last month
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated last year
- ☆256Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 9 months ago
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆39Updated 3 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆268Updated last week
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆113Updated last month
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆366Updated this week
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆130Updated last year
- A tiny paper rating web☆39Updated 4 months ago
- Chat about anything on any video!☆36Updated last year
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆62Updated last year
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆151Updated 2 months ago
- ☆37Updated last week
- ☆194Updated this week
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆15Updated 2 months ago
- [NeurIPS 2024] DEMO: Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning☆47Updated 9 months ago
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆16Updated last month
- Official implementation of MC-LLaVA.☆130Updated 2 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆121Updated 3 months ago
- ☆26Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆186Updated 3 weeks ago
- Video Generation Benchmark☆42Updated 2 months ago
- The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".☆115Updated 10 months ago
- 抢占显卡☆75Updated 9 months ago
- High-performance Image Tokenizers for VAR and AR☆279Updated 3 months ago