anpwu / ZJU-CS-ClassNotesLinks
☆21Updated 3 years ago
Alternatives and similar repositories for ZJU-CS-ClassNotes
Users that are interested in ZJU-CS-ClassNotes are comparing it to the libraries listed below
Sorting:
- A collection of vision foundation models unifying understanding and generation.☆57Updated 10 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆132Updated last year
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.☆24Updated 5 months ago
- ☆36Updated 4 months ago
- A paper list for spatial reasoning☆157Updated this week
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆324Updated 3 weeks ago
- ☆28Updated 7 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆92Updated 3 weeks ago
- [Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…☆48Updated last month
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆36Updated 11 months ago
- ☆259Updated last year
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆17Updated 5 months ago
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectio…☆94Updated 8 months ago
- Generative Universal Verifier as Multimodal Meta-Reasoner☆31Updated 2 weeks ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆58Updated 4 months ago
- [ACMMM 2025 - Dataset Track] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆19Updated 4 months ago
- [NeurIPS 2024] DEMO: Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning☆47Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆159Updated last month
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆62Updated last year
- A tiny paper rating web☆39Updated 7 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆232Updated 6 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆198Updated 3 weeks ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆36Updated last week
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆228Updated 5 months ago
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆79Updated last month
- Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unifie…☆300Updated 3 weeks ago
- Video Generation Benchmark☆60Updated 5 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆134Updated 3 weeks ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆173Updated 3 weeks ago