wren93/tuna

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wren93/tuna)

wren93 / tuna

☆94

Alternatives and similar repositories for tuna

Users that are interested in tuna are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

snowflakewang / CustomX
View on GitHub
[ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models
☆96Jun 25, 2026Updated 3 weeks ago
facebookresearch / tuna-2
View on GitHub
Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆738Updated this week
SOTAMak1r / VINO-code
View on GitHub
A Unified Visual Generator with Interleaved OmniModal Context
☆232Mar 5, 2026Updated 4 months ago
ysy31415 / EffectMaker
View on GitHub
Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
☆42Mar 6, 2026Updated 4 months ago
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MeiGen-AI / PosterReward
View on GitHub
[CVPR2026] PosterReward: Unlocking Accurate Evaluation for High-Quality Graphic Design Generation
☆31Apr 2, 2026Updated 3 months ago
ByteVisionLab / NextFlow
View on GitHub
NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation
☆331Jan 9, 2026Updated 6 months ago
JiazheWei / PosterCopilot
View on GitHub
☆197Dec 10, 2025Updated 7 months ago
wz0919 / AnchorWeave
View on GitHub
Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories
☆96Feb 17, 2026Updated 5 months ago
ZhaoJingjing713 / Spatia
View on GitHub
[CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangle…
☆217May 12, 2026Updated 2 months ago
lizhiqi49 / MoCA
View on GitHub
"MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"
☆177Dec 9, 2025Updated 7 months ago
umm-emma / emma
View on GitHub
Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."
☆62Dec 16, 2025Updated 7 months ago
KlingAIResearch / UniVideo
View on GitHub
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
☆539Jul 3, 2026Updated 2 weeks ago
G-U-N / UniRL
View on GitHub
[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆91May 26, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JavisVerse / JavisGPT
View on GitHub
[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆75Feb 26, 2026Updated 4 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,536Dec 30, 2025Updated 6 months ago
Vchitect / LongVie
View on GitHub
☆333Jan 24, 2026Updated 5 months ago
zhengdian1 / AIA
View on GitHub
☆45Jan 4, 2026Updated 6 months ago
Songlin1998 / ShotVerse
View on GitHub
☆103Mar 13, 2026Updated 4 months ago
MajorDavidZhang / Generalization_unified_VLM
View on GitHub
☆24May 23, 2025Updated last year
DAGroup-PKU / SpatialT2I
View on GitHub
[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆85Mar 2, 2026Updated 4 months ago
sk-adapter / SK-Adapter
View on GitHub
[ECCV2026] Official repo for paper "SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation".
☆62Jun 26, 2026Updated 3 weeks ago
Tencent / HaploVLM
View on GitHub
ICML2025
☆63Aug 28, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 5 months ago
kszpxxzmc / ViSAudio
View on GitHub
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
☆117Dec 11, 2025Updated 7 months ago
Vicky0522 / TokensGen
View on GitHub
[ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation
☆57Dec 10, 2025Updated 7 months ago
jiaosiyuu / ThinkGen
View on GitHub
ThinkGen: Generalized Thinking for Visual Generation
☆60Dec 30, 2025Updated 6 months ago
EnVision-Research / Lotus-2
View on GitHub
Official implementation of "Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model"
☆271Apr 25, 2026Updated 2 months ago
AiEson / Part-X-MLLM
View on GitHub
[ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
☆118Jun 17, 2026Updated last month
AMD-AGI / Nitro-E
View on GitHub
Nitro-E is a family of text-to-image diffusion models focused on highly efficient training.
☆125Jun 4, 2026Updated last month
KlingAIResearch / SVG-T2I
View on GitHub
[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…
☆152Dec 18, 2025Updated 7 months ago
ZhuWenjie98 / DDE
View on GitHub
(ECCV2026) Dual Distribution Estimation for Zero-shot Noisy Test-Time Adaptation with VLMs
☆15Jul 2, 2026Updated 2 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AIFrontierLab / UniGame
View on GitHub
[CVPR'26] UniGame code implementation
☆20Apr 21, 2026Updated 2 months ago
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 10 months ago
Liangsanzhu / Photo3D
View on GitHub
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
☆22Mar 18, 2026Updated 4 months ago
agwmon / self-refine-video
View on GitHub
[ICML 2026] Pytorch implementation of Self-Refining Video Sampling
☆182May 1, 2026Updated 2 months ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆36Jul 3, 2026Updated 2 weeks ago
Kr1sJFU / iMontage
View on GitHub
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
☆188Dec 1, 2025Updated 7 months ago
bytedance-fanqie-ai / MoGA
View on GitHub
Mixture-of-Groups Attention for End-to-End Long Video Generation
☆99Oct 22, 2025Updated 8 months ago