tliby/UniFork

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tliby/UniFork)

tliby / UniFork

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation

☆48

Alternatives and similar repositories for UniFork

Users that are interested in UniFork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

csuhan / Tar
View on GitHub
[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆202Sep 18, 2025Updated 10 months ago
inclusionAI / Ming-UniVision
View on GitHub
Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer
☆143Oct 14, 2025Updated 9 months ago
ModalMinds / MM-PRM
View on GitHub
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision
☆30May 26, 2025Updated last year
yuexy / ST-AR
View on GitHub
☆14Sep 22, 2025Updated 10 months ago
ATH-MaaS / Ovis-U1
View on GitHub
An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…
☆450Dec 2, 2025Updated 7 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
inclusionAI / TC-AE
View on GitHub
Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"
☆24Apr 9, 2026Updated 3 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆204Jan 7, 2026Updated 6 months ago
OpenGVLab / DiffAgent
View on GitHub
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆19Apr 16, 2024Updated 2 years ago
Aoko955 / Flash-VAED
View on GitHub
[ICML 2026] Official codebase for "Flash-VAED: Plug-and-Play VAE Decoders for Efficient Video Generation"
☆27May 9, 2026Updated 2 months ago
FreedomIntelligence / ShareGPT-4o-Image
View on GitHub
☆285Jul 22, 2025Updated last year
mira-wm / mira
View on GitHub
Code for MIRA: Multiplayer Interactive World Models with Representation Autoencoders
☆446Jul 12, 2026Updated last week
KlingAIResearch / SVG-T2I
View on GitHub
[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…
☆152Dec 18, 2025Updated 7 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆529Nov 14, 2025Updated 8 months ago
wenqsun / Real-Play
View on GitHub
Code implementation for: From Virtual Games to Real-World Play
☆48Jun 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
techmonsterwang / iLLaMA
View on GitHub
Adapting LLaMA Decoder to Vision Transformer
☆30May 20, 2024Updated 2 years ago
zhenliuZJU / Forge4D
View on GitHub
Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos
☆53May 2, 2026Updated 2 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆431Jun 20, 2025Updated last year
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆36Feb 22, 2026Updated 5 months ago
ByteVisionLab / NextFlow
View on GitHub
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
☆331Jan 9, 2026Updated 6 months ago
wenqsun / Freeplane
View on GitHub
Code for paper: Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models
☆18Jun 6, 2024Updated 2 years ago
SihuiJi / FashionComposer
View on GitHub
☆24Dec 23, 2024Updated last year
SxJyJay / UniToken
View on GitHub
[CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inpu…
☆106Apr 23, 2025Updated last year
OpenGVLab / Mono-InternVL
View on GitHub
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆109Jul 18, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Franklin-Zhang0 / ReasonGen-R1
View on GitHub
Official respository for ReasonGen-R1
☆75Jun 23, 2025Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
wyhlovecpp / GPT-Image-Edit
View on GitHub
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
☆243Aug 15, 2025Updated 11 months ago
leeruibin / MfM
View on GitHub
[ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
☆32Feb 5, 2026Updated 5 months ago
Harahan / MeanFlowNFT
View on GitHub
[arXiv 2026] This is the official PyTorch implementation of "MeanFlowNFT: Bringing Forward-Process RL to Average-Velocity Generators".
☆59Updated this week
CodeGoat24 / Pref-GRPO
View on GitHub
Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
☆274Feb 10, 2026Updated 5 months ago
Harahan / RTDMD
View on GitHub
[arXiv 2026] This is the official PyTorch implementation of "RTDMD: Reinforcing Few-step Generators via Reward-Tilted Distribution Matchi…
☆41Jun 6, 2026Updated last month
ShivamDuggal4 / UNITE-tokenization-generation
View on GitHub
Single-stage End-to-End Training for Tokenization and Generation
☆117Mar 24, 2026Updated 3 months ago
lucidrains / d4rt
View on GitHub
Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind
☆74Jun 20, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 7 months ago
bytedance / ContentV
View on GitHub
☆130Jun 24, 2025Updated last year
alibaba-damo-academy / Lumos
View on GitHub
[ICLR 2026] Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.
☆161Apr 6, 2026Updated 3 months ago
bimsarapathiraja / refedit
View on GitHub
[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …
☆20Jun 27, 2025Updated last year
selftok-team / SelftokTokenizer
View on GitHub
Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning
☆238May 30, 2025Updated last year
showlab / D-AR
View on GitHub
the official repo for "D-AR: Diffusion via Autoregressive Models"
☆138Jan 29, 2026Updated 5 months ago