anpwu / ZJU-CS-ClassNotesLinks
β21Updated 3 years ago
Alternatives and similar repositories for ZJU-CS-ClassNotes
Users that are interested in ZJU-CS-ClassNotes are comparing it to the libraries listed below
Sorting:
- A collection of vision foundation models unifying understanding and generation.β57Updated 8 months ago
- π This is a repository for organizing papers, codes, and other resources related to unified multimodal models.β279Updated 3 weeks ago
- β258Updated last year
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.β24Updated 3 months ago
- A paper list for spatial reasoningβ134Updated 2 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generationβ117Updated 10 months ago
- Provide .bst files for NeurIPS latex templateβ49Updated 4 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).β16Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generationβ144Updated 2 weeks ago
- The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".β116Updated 11 months ago
- γCOLING 2025π₯γCode for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".β35Updated 8 months ago
- Chat about anything on any video!β36Updated last year
- A tiny paper rating webβ39Updated 5 months ago
- A preview-version of one novel multimodal reasoning benchmark CharmBench.β23Updated 2 weeks ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learningβ131Updated last year
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generationβ158Updated 3 months ago
- β48Updated last week
- [TMLR 2025π₯] A survey for the autoregressive models in vision.β686Updated last week
- β26Updated 5 months ago
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zjuβ62Updated last year
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectioβ¦β82Updated 6 months ago
- [NeurIPS 2024] DEMO: Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioningβ47Updated 10 months ago
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependenciesβ17Updated 2 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.β161Updated 2 weeks ago
- A list of works on video generation towards world modelβ164Updated 3 weeks ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstratingβ¦β121Updated 3 weeks ago
- Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Visionβ101Updated 3 weeks ago
- Frequency Autoregressive Image Generation with Continuous Tokensβ83Updated 2 months ago
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β374Updated 3 weeks ago
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual interventionβ39Updated 4 months ago