showlab/ROICtrl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/showlab/ROICtrl)

showlab / ROICtrl

Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation

☆110

Alternatives and similar repositories for ROICtrl

Users that are interested in ROICtrl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / FAR
View on GitHub
Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
☆311Apr 23, 2025Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
TencentARC / Mix-of-Show
View on GitHub
NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models
☆427May 14, 2024Updated 2 years ago
showlab / EvolveDirector
View on GitHub
[NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.
☆52Oct 14, 2024Updated last year
showlab / VideoSwap
View on GitHub
Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
☆405Dec 6, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
showlab / Exo2Ego-V
View on GitHub
☆61Apr 28, 2025Updated last year
showlab / AUI
View on GitHub
Computer-Use Agents as Judges for Generative UI
☆44Nov 27, 2025Updated 7 months ago
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆148Dec 26, 2024Updated last year
showlab / Show-Anything-3D
View on GitHub
Edit and Generate Anything in 3D world!
☆13Apr 15, 2023Updated 3 years ago
showlab / cosmo
View on GitHub
☆75May 10, 2024Updated 2 years ago
showlab / SMS
View on GitHub
[ICCV 2025] Balanced Image Stylization with Style Matching Score
☆69Mar 9, 2026Updated 4 months ago
video-reality-test / video-reality-test
View on GitHub
☆23May 5, 2026Updated 2 months ago
showlab / Efficient-CLS
View on GitHub
[ICCV 2023] Label-Efficient Online Continual Object Detection in Streaming Video
☆23Jan 8, 2024Updated 2 years ago
NVlabs / AnyFlow
View on GitHub
Flow Map OPD for AnyStep Video Diffusion
☆398May 23, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
guyuchao / iNAS
View on GitHub
Open Source Neural Architecture Search Toolbox for Device-aware Image Dense Prediction & Official implementation of ICCV2021 "iNAS: Integ…
☆84Apr 11, 2022Updated 4 years ago
showlab / Impossible-Videos
View on GitHub
ICML 2025 - Impossible Videos
☆81Jul 23, 2025Updated last year
showlab / MovieBench
View on GitHub
[CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation
☆97Mar 16, 2025Updated last year
showlab / DoraCycle
View on GitHub
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
☆31Mar 8, 2026Updated 4 months ago
TencentARC / HOSNeRF
View on GitHub
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
☆69Dec 12, 2023Updated 2 years ago
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
MCG-NKU / SERE
View on GitHub
Exploring Feature Self-relation for Self-supervised Transformer (TPAMI 2023)
☆21Apr 30, 2025Updated last year
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
showlab / WorldGUI
View on GitHub
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.
☆124Jul 27, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
showlab / Edit2Perceive
View on GitHub
[CVPR 2026] Official Implementation of Edit2Perceive
☆47Feb 21, 2026Updated 5 months ago
lzyhha / HSSL
View on GitHub
Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)
☆15May 2, 2025Updated last year
showlab / Adv-GRPO
View on GitHub
[CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…
☆88Feb 26, 2026Updated 4 months ago
lzyhha / VisualCloze
View on GitHub
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…
☆283Jan 7, 2026Updated 6 months ago
ZX-Yin / DreamLifting
View on GitHub
The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".
☆30Sep 1, 2025Updated 10 months ago
showlab / T2VScore
View on GitHub
T2VScore: Towards A Better Metric for Text-to-Video Generation
☆81Apr 10, 2024Updated 2 years ago
exped1230 / S2-VER
View on GitHub
The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition
☆11Apr 28, 2024Updated 2 years ago
showlab / CLVQA
View on GitHub
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆42Mar 23, 2024Updated 2 years ago
zhaohengyuan1 / Genixer
View on GitHub
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
☆116Mar 21, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
EchoPluto / MagicID
View on GitHub
☆35Mar 18, 2025Updated last year
CUC-MIPG / Edit-Transfer
View on GitHub
Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"
☆89Jun 6, 2025Updated last year
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
CUC-MIPG / UniVid
View on GitHub
Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026
☆37Nov 24, 2025Updated 8 months ago
weijiawu / ParaDiffusion
View on GitHub
[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆107Mar 24, 2025Updated last year
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year
showlab / ShowAnything
View on GitHub
☆83Aug 1, 2023Updated 2 years ago