[ICLR 2026 🔥 ] Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"
☆134Jan 26, 2026Updated last month
Alternatives and similar repositories for UniLIP
Users that are interested in UniLIP are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models☆33Dec 27, 2025Updated 2 months ago
- ☆71Nov 24, 2025Updated 3 months ago
- ☆183Jun 27, 2025Updated 8 months ago
- 🚀 原生使用 Deepspeed 训练 Diffusers | Native Training of Diffusers with Deepspeed☆19Jan 19, 2025Updated last year
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 7 months ago
- Phantom-Data: Towards a General Subject-Consistent Video Generation Dataset☆105Feb 25, 2026Updated last week
- ☆31Jul 16, 2025Updated 7 months ago
- Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"☆129Oct 9, 2025Updated 5 months ago
- Visual Generation Tuning☆99Jan 27, 2026Updated last month
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆49Feb 16, 2026Updated 3 weeks ago
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- [ECCV 2024] Official repository of "GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning".☆29Dec 18, 2024Updated last year
- [NeurIPS 2025 Spotlight] FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities☆69Dec 21, 2025Updated 2 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆63Dec 11, 2025Updated 2 months ago
- [CoRL 2023] DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking☆69Jan 21, 2024Updated 2 years ago
- [ICLR 2026] Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potenti…☆371Feb 5, 2026Updated last month
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆443Aug 8, 2025Updated 7 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆187May 21, 2025Updated 9 months ago
- Structured Video Comprehension of Real-World Shorts☆232Sep 21, 2025Updated 5 months ago
- Implementation of followinf estimation algorithms in python: Kalman Filter, Extended Kalman Filter, Unscented Kalman Filter, Cubature Kal…☆11Dec 2, 2023Updated 2 years ago
- official code for unigame☆19Nov 26, 2025Updated 3 months ago
- Python code to interface a Raspberry Pi 4 with a Bluetooth OBD II Adapter to retrieve data, displaying those values onto a GUI using pyga…☆12Dec 26, 2023Updated 2 years ago
- ☆31Jun 19, 2025Updated 8 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆176Sep 1, 2025Updated 6 months ago
- this start a http server for flutter web, it can also proxy api requests for Cross-Origin Request.为flutter web启动一个http服务器,并且可以代理api请求,解决跨…☆15Dec 15, 2019Updated 6 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- the tron wallet of flash☆13Feb 18, 2021Updated 5 years ago
- ☆16Sep 17, 2024Updated last year
- ☆41Oct 19, 2025Updated 4 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- An easy to configure, modular build tool running on esbuild with a powerful plugin api.☆12Dec 6, 2021Updated 4 years ago
- Sharing of geographical information made really, really easy!☆23Mar 12, 2013Updated 12 years ago
- A SWC plugin for remove matched charset☆10Dec 26, 2024Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆19Jan 6, 2026Updated 2 months ago
- FOR FRESHMAN IN NUDT CORE LAB RESEARCH AND CODING TOOLS☆10Jun 19, 2022Updated 3 years ago
- ☆11Oct 30, 2024Updated last year
- SimX-OR: Extending Any Simulation Benchmark to Evaluate the Observational Robustness of VLA Models☆31Nov 4, 2025Updated 4 months ago
- Reimplementation of D4RT☆34Dec 26, 2025Updated 2 months ago
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago