anpwu / ZJU-CS-ClassNotesLinks
☆21Updated 3 years ago
Alternatives and similar repositories for ZJU-CS-ClassNotes
Users that are interested in ZJU-CS-ClassNotes are comparing it to the libraries listed below
Sorting:
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆79Updated 3 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆143Updated last year
- A collection of vision foundation models unifying understanding and generation.☆59Updated 11 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆127Updated 2 months ago
- A tiny paper rating web☆38Updated 9 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆339Updated 2 months ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆37Updated last year
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆136Updated 2 months ago
- Provide .bst files for NeurIPS latex template☆49Updated 8 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆135Updated last month
- This is a collection of recent papers on reasoning in video generation models.☆83Updated this week
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆246Updated 8 months ago
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆207Updated 4 months ago
- ☆33Updated 9 months ago
- This mathematics course is taught for the first year Ph.D. students of computer science and related areas @zju☆60Updated last year
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆99Updated 5 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆222Updated last week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆193Updated 2 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆220Updated 2 months ago
- [ICML 2025] DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization☆20Updated 6 months ago
- A paper list for spatial reasoning☆521Updated last week
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆101Updated last week
- Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unifie…☆334Updated this week
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆35Updated 8 months ago
- Official codebase for the paper Latent Visual Reasoning☆60Updated last month
- A collection of awesome think with videos papers.☆73Updated 2 weeks ago
- Chat about anything on any video!☆36Updated 2 years ago
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆68Updated 4 months ago
- ☆263Updated last year
- Incentivizing "Thinking with Long Videos" via Native Tool Calling☆142Updated this week