UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
☆46Aug 26, 2025Updated 7 months ago
Alternatives and similar repositories for UniFork
Users that are interested in UniFork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆202Sep 18, 2025Updated 6 months ago
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆42Sep 30, 2025Updated 6 months ago
- ☆14Sep 22, 2025Updated 6 months ago
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 9 months ago
- [ICLR' 25] The PyTorch implementation of our paper: "Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Lea…☆21Feb 26, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models☆18Jun 6, 2024Updated last year
- Official Pytorch Implementation of Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model [ECCV'24]☆22Dec 24, 2024Updated last year
- Consistent Autoregressive Video Generation with Long Context☆81Feb 6, 2026Updated 2 months ago
- ☆56Nov 8, 2025Updated 5 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆137Jan 29, 2026Updated 2 months ago
- [CVPR 2025 Highlight] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation☆66May 9, 2025Updated 11 months ago
- pytorch implementation of "Efficiently Reconstructing Dynamic Scenes One 🎯 D4RT at a Time"☆52Jan 27, 2026Updated 2 months ago
- [ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation☆35Jun 13, 2025Updated 10 months ago
- [ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation☆57Dec 10, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆24Dec 23, 2024Updated last year
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Mar 24, 2026Updated 2 weeks ago
- ☆25Nov 25, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆29Dec 18, 2025Updated 3 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆220Jun 17, 2025Updated 9 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆141Dec 18, 2025Updated 3 months ago
- ☆132Jun 24, 2025Updated 9 months ago
- MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction☆142Apr 27, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Awesome Unified Multimodal Models☆1,181Mar 24, 2026Updated 2 weeks ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 4 months ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆238May 30, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆518Nov 14, 2025Updated 4 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆203Mar 11, 2026Updated last month
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆85Jul 23, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆142Mar 25, 2026Updated 2 weeks ago
- Official respository for ReasonGen-R1☆75Jun 23, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering☆120Mar 12, 2026Updated last month
- Code of MSCF-tracker v1.0 (Matlab Version for Discussion)☆26Jun 16, 2021Updated 4 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 8 months ago
- [NeurIPS 2025 Official Codes] Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards☆44Sep 23, 2025Updated 6 months ago
- Curated list of papers and resources focused on neural compression, intended to keep pace with the anticipated surge of research in the r…☆83Aug 1, 2024Updated last year
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated last year
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆450Dec 2, 2025Updated 4 months ago