tliby / UniForkLinks
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
β46Updated 5 months ago
Alternatives and similar repositories for UniFork
Users that are interested in UniFork are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] π» Uniform Discrete Diffusion with Metric Path for Video Generationβ98Updated 3 weeks ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generationβ116Updated 4 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"β51Updated last week
- [ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generationβ56Updated 2 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)β86Updated 11 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generationβ233Updated 3 months ago
- β35Updated last month
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencodersβ202Updated last week
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Modelsβ157Updated last month
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesisβ62Updated 9 months ago
- Consistent Autoregressive Video Generation with Long Contextβ31Updated this week
- Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Geneβ¦β298Updated this week
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Imageβ55Updated last year
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Controlβ96Updated 8 months ago
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiTβ163Updated 3 months ago
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPOβ92Updated 2 months ago
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β172Updated last month
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Textβ53Updated 10 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representationsβ198Updated 4 months ago
- β47Updated 9 months ago
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"β255Updated last month
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photographyβ100Updated last month
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformersβ77Updated 6 months ago
- β100Updated 2 weeks ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controllerβ50Updated 6 months ago
- β52Updated last year
- Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?β216Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)β53Updated 9 months ago
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Timeβ319Updated 3 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Attenβ¦β64Updated 7 months ago