Jodi: Unification of Visual Generation and Understanding via Joint Modeling
☆90Jun 19, 2025Updated 8 months ago
Alternatives and similar repositories for Jodi
Users that are interested in Jodi are comparing it to the libraries listed below
Sorting:
- A better shell☆14Feb 5, 2026Updated 3 weeks ago
- Instance-level Facial Attributes Editing (CVIU 2021)☆15Jul 19, 2022Updated 3 years ago
- ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"☆17Jun 26, 2025Updated 8 months ago
- ☆17Dec 8, 2020Updated 5 years ago
- Cluster Document for IIL@HIT☆20Apr 5, 2023Updated 2 years ago
- Implement Diffusion Models with PyTorch.☆23Nov 24, 2024Updated last year
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated 10 months ago
- Distilling Diversity and Control in Diffusion Models☆50Apr 28, 2025Updated 9 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 2 years ago
- Official implementation of "Describing Sets of Images with Textual-PCA".☆16Feb 13, 2023Updated 3 years ago
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated 8 months ago
- [ICLR 2025] Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"☆262Jan 12, 2026Updated last month
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆30Jul 5, 2023Updated 2 years ago
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆26Dec 12, 2024Updated last year
- PreLAR: World Model Pre-training with Learnable Action Representation, ECCV 2024☆169Apr 15, 2025Updated 10 months ago
- Simple Tensorflow implementation of "Adaptive Convolutions for Structure-Aware Style Transfer" (CVPR 2021)☆25Aug 24, 2022Updated 3 years ago
- A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration☆17Jul 22, 2022Updated 3 years ago
- Breaking Boundary Between Pre-training and Fine-tuning with Hybrid Prompting for Knowledge-Based VQA☆140Mar 10, 2024Updated last year
- ☆17Jan 17, 2025Updated last year
- Whitening and Coloring transform for GANs☆35May 15, 2019Updated 6 years ago
- ☆48Feb 9, 2026Updated 2 weeks ago
- This repository is the official implementation of the paper "Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation D…☆143Oct 17, 2023Updated 2 years ago
- Code Implementation of “RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers”☆30Dec 27, 2025Updated 2 months ago
- [IEEE TMM 2024] NIR-Assisted Image Denoising: A Selective Fusion Approach and A Real-World Benchmark Dataset☆21Feb 23, 2025Updated last year
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 4 years ago
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆85Sep 18, 2025Updated 5 months ago
- ☆22Nov 4, 2024Updated last year
- ☆19Dec 18, 2024Updated last year
- Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆165Oct 21, 2025Updated 4 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated last year
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆20May 2, 2025Updated 9 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated last month
- Controllable Hair Editing (ECCV 2022)☆83May 11, 2023Updated 2 years ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆270Apr 15, 2025Updated 10 months ago
- ☆44Nov 14, 2019Updated 6 years ago
- [AAAI2026] Implementation Code for Omni-Effects☆173Dec 9, 2025Updated 2 months ago