umm-emma / emmaLinks
Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."
☆61Updated last month
Alternatives and similar repositories for emma
Users that are interested in emma are comparing it to the libraries listed below
Sorting:
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆46Updated 9 months ago
- ☆107Updated 4 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆55Updated 8 months ago
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆125Updated last week
- ☆46Updated 2 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆68Updated 4 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Updated last month
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆39Updated 6 months ago
- ☆31Updated 5 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆78Updated 10 months ago
- ☆52Updated 3 weeks ago
- An official implementation of SwapAnyone.☆73Updated 10 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 4 months ago
- Official implementation of "VideoMaMa: Mask-Guided Video Matting via Generative Prior"☆59Updated last week
- ☆32Updated 10 months ago
- Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance (arXiv 2025)☆53Updated 6 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆45Updated 2 months ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆178Updated 3 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆167Updated 3 weeks ago
- Code for CineScale, higher-resolution video generation based on Wan☆183Updated 5 months ago
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated last week
- VideoCoF: Unified Video Editing with Temporal Reasoner☆129Updated 3 weeks ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆107Updated last month
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69Updated 8 months ago
- Distilling Diversity and Control in Diffusion Models☆50Updated 9 months ago
- [arXiv 2025] Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers☆53Updated 5 months ago
- The official UniVerse-1 code.☆119Updated 3 months ago
- Animate Any Character in Any World☆89Updated 3 weeks ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆123Updated last year
- [NeurIPS 2024] Official Implementation of GrounDiT☆58Updated last year