Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."
☆62Dec 16, 2025Updated 3 months ago
Alternatives and similar repositories for emma
Users that are interested in emma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Dec 12, 2025Updated 3 months ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆62Jul 1, 2025Updated 8 months ago
- ☆33Aug 9, 2024Updated last year
- OmniGAIA: Towards Native Omni-Modal AI Agents☆82Mar 16, 2026Updated last week
- ☆22Feb 13, 2026Updated last month
- ☆14Sep 22, 2025Updated 6 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 7 months ago
- BotCorner 2.0☆12Jul 12, 2023Updated 2 years ago
- Advanced FLUX LoRA manipulation toolkit with GUI interface☆58Nov 5, 2025Updated 4 months ago
- ☆12Jun 1, 2023Updated 2 years ago
- Distance Guided Channel Weighting for Semantic Sgementation (https://arxiv.org/abs/2004.12679)☆14Nov 24, 2020Updated 5 years ago
- [CVPR 2026] Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration☆40Feb 25, 2026Updated 3 weeks ago
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated 3 weeks ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆121Dec 17, 2025Updated 3 months ago
- ☆11Sep 4, 2022Updated 3 years ago
- Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion☆101Mar 12, 2026Updated last week
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆41Jan 29, 2026Updated last month
- A JavaFX desktop application for extracting and managing AI image generation metadata across multiple platforms. Features recursive parsi…☆38Jan 11, 2026Updated 2 months ago
- Nodes to level up your workflows performance and streamline specific functions.☆10Aug 19, 2025Updated 7 months ago
- 🔊Replicate Cog'ified MMAudio🎵☆18Jul 10, 2025Updated 8 months ago
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆36Feb 6, 2026Updated last month
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆39Dec 30, 2025Updated 2 months ago
- ☆114Apr 25, 2025Updated 10 months ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- MediaWikiBot,通过聊天软件对MediaWiki进行信息查询的机器人,支持QQ,Telegram,Line,KaiHeiLa☆11Jun 17, 2025Updated 9 months ago
- Pytorch Lightning Template for Sematic Segmentation☆11Jan 17, 2023Updated 3 years ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆73May 24, 2024Updated last year
- An API with an intuitive visual interface that enables the unofficial integration of DeepSeek into SillyTavern.☆23Mar 11, 2026Updated last week
- ☆18May 15, 2025Updated 10 months ago
- ☆45Nov 26, 2025Updated 3 months ago
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation☆37Aug 1, 2025Updated 7 months ago
- Nodes for image juxtaposition for Flux in ComfyUI☆12Apr 22, 2025Updated 11 months ago
- Instaswap Desktop App☆19Dec 11, 2024Updated last year
- ComfyUI custom nodes for AudioX — generate sound effects and background music from video, powered by HKUSTAudio/AudioX.☆31Mar 12, 2026Updated last week
- FIBO-Edit brings the power of structured prompt generation to image editing☆32Jan 29, 2026Updated last month
- Minimalistically refactored reference Gaussian splatting library☆16Feb 24, 2025Updated last year
- ☆15Sep 4, 2024Updated last year
- ☆76Mar 16, 2026Updated last week