Mixture-of-Groups Attention for End-to-End Long Video Generation
☆95Oct 22, 2025Updated 5 months ago
Alternatives and similar repositories for MoGA
Users that are interested in MoGA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Mar 21, 2025Updated last year
- 中科大跨模态智能组-每周论文分享☆16Nov 20, 2022Updated 3 years ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆157Mar 4, 2026Updated last month
- ☆14Apr 23, 2023Updated 2 years ago
- [ICLR 2026] Official repo for paper "Video-As-Prompt: Unified Semantic Control for Video Generation"☆409Feb 8, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 5 months ago
- Supplementary Material to accompany the paper, DJ Warne, SA Sisson, C Drovandi (2019) Acceleration of expensive computations in Bayesian…☆13Oct 23, 2020Updated 5 years ago
- Code for utilising VAE as means of doing exact MCMC inference in complex high-dimensional space☆14Jun 20, 2023Updated 2 years ago
- Official implementation of project NoiseCLR, published at CVPR 2024☆29Jun 15, 2024Updated last year
- ☆17May 13, 2025Updated 10 months ago
- [NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models☆44Sep 30, 2024Updated last year
- Code to reproduce experiments in Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows☆14May 23, 2024Updated last year
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆41Jan 5, 2026Updated 3 months ago
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Wan: Open and Advanced Large-Scale Video Generative Models☆28Jul 28, 2025Updated 8 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆28Feb 11, 2026Updated last month
- Probabilistic deep learning using JAX☆15Feb 8, 2025Updated last year
- Official repo for “Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy”☆14Nov 26, 2024Updated last year
- ☆16May 30, 2023Updated 2 years ago
- ☆10Nov 18, 2024Updated last year
- DINO-based perceptual losses and FDD feature extraction☆26Jan 7, 2026Updated 3 months ago
- [NeurIPS2024] Official Pytorch implementation of Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You …☆28Updated this week
- ☆23Dec 11, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Covo-Audio is a 7B-parameter end-to-end large audio language model that directly processes continuous audio inputs and generates audio ou…☆123Mar 17, 2026Updated 3 weeks ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆63Mar 6, 2026Updated last month
- [CVPR 2026] Official repository of Vision Test-Time Training☆68Mar 10, 2026Updated 3 weeks ago
- ☆72Mar 9, 2025Updated last year
- Gradient-informed particle MCMC methods☆12Jan 29, 2024Updated 2 years ago
- A summary of recent unsupervised semantic segmentation methods☆100May 8, 2023Updated 2 years ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated last year
- ☆38Dec 18, 2025Updated 3 months ago
- Perceiver (transformer variant) implemented in JAX and Flax☆13Mar 29, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆145Nov 13, 2025Updated 4 months ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆87Mar 9, 2026Updated last month
- My implement of InstantBooth☆13Sep 11, 2023Updated 2 years ago
- IEEE TNNLS, Collaborative Camouflaged Object Detection, CoCOD8K☆21Nov 24, 2023Updated 2 years ago
- [ICCV 2025] Identity Preserving 3D Head Stylization with Multiview Score Distillation☆16Jun 25, 2025Updated 9 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Make self forcing endless. Add cache purging. Add prompt controllability.☆70Sep 9, 2025Updated 7 months ago