UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
☆46Aug 26, 2025Updated 6 months ago
Alternatives and similar repositories for UniFork
Users that are interested in UniFork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 6 months ago
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆40Sep 30, 2025Updated 5 months ago
- ☆14Sep 22, 2025Updated 6 months ago
- [ICLR' 25] The PyTorch implementation of our paper: "Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Lea…☆21Feb 26, 2025Updated last year
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 9 months ago
- Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models☆18Jun 6, 2024Updated last year
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆28May 26, 2025Updated 9 months ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- Official Pytorch Implementation of Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model [ECCV'24]☆22Dec 24, 2024Updated last year
- ☆54Nov 8, 2025Updated 4 months ago
- Consistent Autoregressive Video Generation with Long Context☆75Feb 6, 2026Updated last month
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆135Jan 29, 2026Updated last month
- pytorch implementation of "Efficiently Reconstructing Dynamic Scenes One 🎯 D4RT at a Time"☆48Jan 27, 2026Updated last month
- [CVPR 2025 Highlight] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation☆64May 9, 2025Updated 10 months ago
- [ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation☆35Jun 13, 2025Updated 9 months ago
- ☆24Dec 23, 2024Updated last year
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 4 months ago
- Collection of peptide de novo sequencing algorithms by BEAM labs☆30Dec 6, 2025Updated 3 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆212Jun 17, 2025Updated 9 months ago
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆28Dec 18, 2025Updated 3 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆138Dec 18, 2025Updated 3 months ago
- [ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …☆19Jun 27, 2025Updated 8 months ago
- MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction☆142Apr 27, 2024Updated last year
- ☆132Jun 24, 2025Updated 8 months ago
- Awesome Unified Multimodal Models☆1,152Feb 6, 2026Updated last month
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 4 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆238May 30, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆85Jul 23, 2024Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆138Oct 22, 2025Updated 5 months ago
- ☆23Mar 15, 2024Updated 2 years ago
- ☆17Feb 20, 2025Updated last year
- [ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering☆111Mar 12, 2026Updated last week
- Official respository for ReasonGen-R1☆75Jun 23, 2025Updated 9 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆204Jun 10, 2025Updated 9 months ago
- Code of MSCF-tracker v1.0 (Matlab Version for Discussion)☆26Jun 16, 2021Updated 4 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21Jul 26, 2025Updated 7 months ago