DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
☆18Feb 13, 2026Updated 2 weeks ago
Alternatives and similar repositories for DreamID-Omni
Users that are interested in DreamID-Omni are comparing it to the libraries listed below
Sorting:
- ☆41Feb 6, 2026Updated 3 weeks ago
- ☆24Oct 15, 2025Updated 4 months ago
- ☆40Dec 16, 2025Updated 2 months ago
- ☆81Feb 11, 2026Updated 2 weeks ago
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆51Jan 5, 2026Updated last month
- Video Diffusion Transformers are In-Context Learners☆35Jan 6, 2025Updated last year
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 9 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆154Sep 24, 2025Updated 5 months ago
- Mixture-of-Groups Attention for End-to-End Long Video Generation☆92Oct 22, 2025Updated 4 months ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- ☆14Feb 13, 2026Updated 2 weeks ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆116Oct 7, 2025Updated 4 months ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated 3 weeks ago
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆123Feb 6, 2026Updated 3 weeks ago
- ☆105Jan 6, 2026Updated last month
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆219Updated this week
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆32Nov 11, 2025Updated 3 months ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆29Sep 12, 2025Updated 5 months ago
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆115Jan 30, 2026Updated 3 weeks ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated 2 months ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆22Feb 15, 2026Updated last week
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆38Oct 9, 2025Updated 4 months ago
- This is the official implementation of our paper: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame In…☆14Dec 5, 2025Updated 2 months ago
- Local 4B codebase explorer agent distilled from Qwen3-Coder-Next.☆70Updated this week
- Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control☆34Feb 4, 2026Updated 3 weeks ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- ☆22Nov 18, 2025Updated 3 months ago
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics☆121Jan 26, 2026Updated last month
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Official project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).☆29Jan 31, 2026Updated 3 weeks ago
- ☆35Feb 12, 2026Updated 2 weeks ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 2 weeks ago
- An efficient distillation method for flow matching models☆22Feb 1, 2026Updated 3 weeks ago
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆14Mar 26, 2025Updated 11 months ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- A no-dependency utility to undervolt Intel CPUs on Linux systems, with user-friendly GUI☆16Apr 19, 2025Updated 10 months ago
- [CVPR25] IAR☆17Jun 13, 2025Updated 8 months ago