UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
☆47Aug 26, 2025Updated 8 months ago
Alternatives and similar repositories for UniFork
Users that are interested in UniFork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 8 months ago
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆52May 2, 2026Updated 3 weeks ago
- ☆14Sep 22, 2025Updated 8 months ago
- Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind☆64Updated this week
- Code implementation for: From Virtual Games to Real-World Play☆47Jun 23, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR' 25] The PyTorch implementation of our paper: "Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Lea…☆23Feb 26, 2025Updated last year
- Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models☆18Jun 6, 2024Updated last year
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆28May 26, 2025Updated 11 months ago
- CVPR 2025 Accepted Papers☆25Dec 20, 2025Updated 5 months ago
- Official Pytorch Implementation of Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model [ECCV'24]☆22Dec 24, 2024Updated last year
- Consistent Autoregressive Video Generation with Long Context☆85Feb 6, 2026Updated 3 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆224Apr 27, 2026Updated 3 weeks ago
- ☆59Nov 8, 2025Updated 6 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆138Jan 29, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025 Highlight] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation☆66May 9, 2025Updated last year
- [ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation☆36Jun 13, 2025Updated 11 months ago
- [ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation☆57Dec 10, 2025Updated 5 months ago
- ☆24Dec 23, 2024Updated last year
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated last month
- ☆25Nov 25, 2024Updated last year
- A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models☆113Updated this week
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆32Dec 18, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆240Jun 17, 2025Updated 11 months ago
- [ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …☆20Jun 27, 2025Updated 10 months ago
- ☆132Jun 24, 2025Updated 10 months ago
- MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction☆142Apr 27, 2024Updated 2 years ago
- [Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…☆152Dec 18, 2025Updated 5 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 6 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- Awesome Unified Multimodal Models☆1,256Mar 24, 2026Updated last month
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆236May 30, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PDR-小组数据共享计划☆17Dec 1, 2022Updated 3 years ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆523Nov 14, 2025Updated 6 months ago
- Official repository of "Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection" [ICCV 2025]☆21Jan 17, 2026Updated 4 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆85Jul 23, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆154May 11, 2026Updated last week
- ☆23Mar 15, 2024Updated 2 years ago