[CVPR 2026] Official code for paper: SMRABooth: Subject and Motion Representation Alignment for Customized Video Generation
☆37Feb 21, 2026Updated 2 months ago
Alternatives and similar repositories for SMRABooth
Users that are interested in SMRABooth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24May 9, 2024Updated last year
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 3 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 6 years ago
- ☆21Jun 3, 2023Updated 2 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CVPR 2025 Accepted Papers☆25Dec 20, 2025Updated 4 months ago
- pytorch implementation of XMC-GAN☆11Jun 2, 2021Updated 4 years ago
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 9 months ago
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models☆72Apr 23, 2026Updated last week
- Code for the paper Semantic-Guided Inpainting Network for Complex UrbanScenes Manipulation☆13Jul 7, 2021Updated 4 years ago
- ☆18Mar 21, 2025Updated last year
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆30Apr 9, 2026Updated 3 weeks ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆92Oct 15, 2025Updated 6 months ago
- [CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction☆41May 16, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆200Apr 13, 2026Updated 3 weeks ago
- 中科大跨模态智能组-每周论文分享☆15Nov 20, 2022Updated 3 years ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆45Apr 9, 2024Updated 2 years ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation☆40Nov 24, 2025Updated 5 months ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆70Mar 27, 2026Updated last month
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- ☆30May 7, 2025Updated 11 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆29Apr 3, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode☆17Feb 16, 2025Updated last year
- PyTorch implementation of COCO-GAN (https://hubert0527.github.io/COCO-GAN/)☆25Nov 6, 2019Updated 6 years ago
- A PyTorch Implementation of Segmentation for Image Inpainting based on SPG-Net: https://arxiv.org/abs/1805.03356☆21Dec 19, 2019Updated 6 years ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆42Jan 29, 2026Updated 3 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- ☆39Jun 20, 2024Updated last year
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆88Mar 9, 2026Updated last month
- ☆24Nov 10, 2019Updated 6 years ago
- R-Precision evaluation for AttnGAN based model☆26Sep 13, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer☆155Mar 16, 2026Updated last month
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆72Feb 26, 2026Updated 2 months ago
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆24Nov 14, 2025Updated 5 months ago
- [NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.☆36Oct 23, 2024Updated last year
- [CVPR2020] Tensorflow implementation for paper ''Distribution-induced Bidirectional Generative Adversarial Network for Graph Representati…☆31Nov 24, 2021Updated 4 years ago
- Re-implementation of https://github.com/zsdonghao/text-to-image☆25Dec 28, 2017Updated 8 years ago