hyungjin-chung / VPSLinks
☆15Updated 4 months ago
Alternatives and similar repositories for VPS
Users that are interested in VPS are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆154Updated last week
- Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward☆59Updated last month
- Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".☆146Updated 3 months ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆75Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Updated 10 months ago
- The official repository of "Spectral Motion Alignment for Video Motion Transfer using Diffusion Models".☆31Updated last year
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆112Updated 3 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆58Updated 2 weeks ago
- Evaluation codes and data for GenEval2☆51Updated last week
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆168Updated last month
- ☆47Updated 8 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Updated 9 months ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Updated last year
- [CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"☆28Updated 7 months ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Updated 2 years ago
- [ICCV-2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆52Updated 5 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Updated 5 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆42Updated 10 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆51Updated 8 months ago
- ☆34Updated 3 weeks ago
- [ICCV 2025] Official Implementation of Steering Rectified Flow Models in the Vector Field for Controlled Image Generation☆41Updated 6 months ago
- ☆20Updated 7 months ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆28Updated 5 months ago
- [ECCV 2024 Oral] ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction☆75Updated last year
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆94Updated 2 months ago
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Updated 3 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆61Updated 11 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Updated 11 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆185Updated 7 months ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆34Updated 7 months ago