[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆151Jan 26, 2026Updated 5 months ago
Alternatives and similar repositories for UniLIP
Users that are interested in UniLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visual Generation Tuning☆101Apr 16, 2026Updated 2 months ago
- ☆188Jun 27, 2025Updated last year
- ☆31Jul 16, 2025Updated 11 months ago
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆37Apr 2, 2026Updated 2 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆48Jul 22, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source code for the Information Sciences paper "Rumor Detection on Social Media through Mining the Social Circles with High Homogeneity"☆21Jun 10, 2023Updated 3 years ago
- Unofficial Implementation of Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis☆16Sep 27, 2023Updated 2 years ago
- ☆72Nov 24, 2025Updated 7 months ago
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆53Jun 1, 2026Updated 3 weeks ago
- Controlnet module for Wan2.1☆32Aug 4, 2025Updated 10 months ago
- [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…☆406May 23, 2026Updated last month
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆23Oct 28, 2025Updated 8 months ago
- Reddit Crawler API for collecting datasets from Reddit.☆11Dec 31, 2022Updated 3 years ago
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'26] UniGame code implementation☆20Apr 21, 2026Updated 2 months ago
- GEditBench v2: A Human-Aligned Benchmark for General Image Editing☆57Jun 18, 2026Updated last week
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆191May 21, 2025Updated last year
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆13Dec 21, 2025Updated 6 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆465Aug 8, 2025Updated 10 months ago
- Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation☆25May 14, 2026Updated last month
- [IJCAI'24] Official code for our paper "Make Graph Neural Networks Great Again: A Generic Integration Paradigm of Topology-Free Patterns …☆15Jul 3, 2025Updated 11 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- Official implementation for "Revisiting Discriminative vs. Generative Classifiers: Theory and Implications".☆14Feb 7, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning☆235Jan 22, 2026Updated 5 months ago
- Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)☆10Jan 5, 2022Updated 4 years ago
- ☆16May 18, 2026Updated last month
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations☆202Sep 18, 2025Updated 9 months ago
- [ICLR 2026] Any-step Generation via N-th Order Recursive Consistent Velocity Field Estimation☆36Feb 4, 2026Updated 4 months ago
- Hyperspectral Imagery One Class Classification (ISPRS 2022 & TGRS 2023)☆13Jan 28, 2026Updated 5 months ago
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 4 years ago
- Icon Matching with Shape Context Descriptors.☆10Jun 20, 2026Updated last week
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆877Dec 23, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2024] Boosting Adversarial Transferability by Block Shuffle and Rotation☆14Feb 28, 2024Updated 2 years ago
- ☆30Jun 19, 2025Updated last year
- ☆15Mar 21, 2025Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jun 2, 2026Updated 3 weeks ago
- 🚀 原生使用 Deepspeed 训练 Diffusers | Native Training of Diffusers with Deepspeed☆19Jan 19, 2025Updated last year
- A non-official re-implementation of article "[ECCV 18] Image Inpainting for Irregular Holes Using Partial Convolutions"☆12Mar 1, 2025Updated last year
- ☆15Dec 9, 2024Updated last year