[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆150Jan 26, 2026Updated 3 months ago
Alternatives and similar repositories for UniLIP
Users that are interested in UniLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Generation Tuning☆100Apr 16, 2026Updated last month
- ☆187Jun 27, 2025Updated 10 months ago
- ☆31Jul 16, 2025Updated 10 months ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆37Apr 2, 2026Updated last month
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆47Jul 22, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Oct 21, 2024Updated last year
- Unofficial Implementation of Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis☆16Sep 27, 2023Updated 2 years ago
- ☆72Nov 24, 2025Updated 5 months ago
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆51Feb 16, 2026Updated 3 months ago
- Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"☆23Dec 22, 2025Updated 4 months ago
- Controlnet module for Wan2.1☆31Aug 4, 2025Updated 9 months ago
- [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…☆394Updated this week
- Reddit Crawler API for collecting datasets from Reddit.☆11Dec 31, 2022Updated 3 years ago
- YOLOv8安全帽工作服检测☆13Oct 13, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- GEditBench v2: A Human-Aligned Benchmark for General Image Editing☆53Apr 1, 2026Updated last month
- Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation☆24Updated this week
- [CVPR'26] UniGame code implementation☆19Apr 21, 2026Updated 3 weeks ago
- H2ASeg: Hierarchical Adaptive Interaction and Weighting Network for Tumor Segmentation in PET/CT Images☆20May 29, 2025Updated 11 months ago
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆28Nov 18, 2025Updated 6 months ago
- ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling☆88Apr 22, 2026Updated 3 weeks ago
- RS Generate dataset☆18Jan 2, 2025Updated last year
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆189May 21, 2025Updated 11 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆460Aug 8, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [IJCAI'24] Official code for our paper "Make Graph Neural Networks Great Again: A Generic Integration Paradigm of Topology-Free Patterns …☆15Jul 3, 2025Updated 10 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆24Mar 8, 2026Updated 2 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆21Jan 27, 2025Updated last year
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆236Jan 22, 2026Updated 3 months ago
- Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)☆10Jan 5, 2022Updated 4 years ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆201Sep 18, 2025Updated 8 months ago
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 4 years ago
- [ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation☆36Feb 4, 2026Updated 3 months ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆874Dec 23, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆30Jun 19, 2025Updated 11 months ago
- [ICCV2025] Training-Free Diffusion Models for Geometric Image Editing☆33Jan 13, 2026Updated 4 months ago
- ☆22Sep 23, 2025Updated 7 months ago
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆13Jan 22, 2025Updated last year
- ☆15Dec 9, 2024Updated last year
- [AAAI 2026] This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆141Apr 24, 2026Updated 3 weeks ago