Mixture-of-Groups Attention for End-to-End Long Video Generation
☆97Oct 22, 2025Updated 6 months ago
Alternatives and similar repositories for MoGA
Users that are interested in MoGA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Mar 21, 2025Updated last year
- 中科大跨模态智能组-每周论文分享☆16Nov 20, 2022Updated 3 years ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆156Mar 4, 2026Updated last month
- [ICLR 2026] Generative View Stitching☆108Nov 7, 2025Updated 5 months ago
- Implementation of our IJCAI2022 oral paper, ER-SAN: Enhanced-Adaptive Relation Self-Attention Network for Image Captioning.☆24Aug 5, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Apr 23, 2023Updated 3 years ago
- [ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"☆417Feb 8, 2026Updated 2 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 6 months ago
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- Official implementation of project NoiseCLR, published at CVPR 2024☆29Jun 15, 2024Updated last year
- ☆17May 13, 2025Updated 11 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆44Sep 30, 2024Updated last year
- [NeurIPS25 Spotlight] Official Implementation for CBSA (Contract-and-Broadcast Self-Attention)☆36Apr 3, 2026Updated 3 weeks ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆41Jan 5, 2026Updated 3 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory☆154Feb 9, 2026Updated 2 months ago
- [CVPR 2026 Highlight] Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆667Nov 26, 2025Updated 5 months ago
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 11 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- Official repo for “Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy”☆14Nov 26, 2024Updated last year
- Wan: Open and Advanced Large-Scale Video Generative Models☆29Jul 28, 2025Updated 9 months ago
- ☆10Nov 18, 2024Updated last year
- DINO-based perceptual losses and FDD feature extraction☆27Jan 7, 2026Updated 3 months ago
- Official project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).☆34Mar 19, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- ☆23Dec 11, 2024Updated last year
- Towards Sustainable Learning: Coresets for Data-efficient Deep Learning☆13Jul 5, 2023Updated 2 years ago
- ☆72Mar 9, 2025Updated last year
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆22Mar 29, 2025Updated last year
- ☆38Dec 18, 2025Updated 4 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆35Apr 21, 2026Updated last week
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆147Nov 13, 2025Updated 5 months ago
- IEEE TNNLS, Collaborative Camouflaged Object Detection, CoCOD8K☆21Nov 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 10 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆88Mar 9, 2026Updated last month
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Make self forcing endless. Add cache purging. Add prompt controllability.☆70Sep 9, 2025Updated 7 months ago
- ☆20Jun 4, 2025Updated 10 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆31Mar 26, 2025Updated last year