modelscope / Nexus-GenLinks
☆229Updated last month
Alternatives and similar repositories for Nexus-Gen
Users that are interested in Nexus-Gen are comparing it to the libraries listed below
Sorting:
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆629Updated 2 weeks ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆139Updated 3 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆192Updated 4 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆338Updated last week
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆278Updated 2 weeks ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆311Updated 11 months ago
- ☆111Updated 3 weeks ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆258Updated 3 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆222Updated 8 months ago
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆320Updated this week
- VideoGen-Eval: Agent-based System for Video Generation Evaluation☆241Updated 3 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 5 months ago
- ImgEdit: A Unified Image Editing Dataset and Benchmark☆138Updated 2 weeks ago
- Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.☆375Updated this week
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"☆186Updated 3 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆156Updated 2 weeks ago
- ☆250Updated 11 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆284Updated last week
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆385Updated 3 weeks ago
- Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆207Updated 3 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆79Updated last month
- ☆121Updated 3 weeks ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation☆221Updated 3 weeks ago
- [ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from u…☆205Updated 2 months ago
- ☆344Updated 3 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆109Updated 2 months ago
- Official code for ICCV 205 paper, X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distilla…☆74Updated 2 weeks ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆126Updated 4 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆164Updated 3 weeks ago
- RepText: Rendering Visual Text via Replicating 🔥☆119Updated last month