zhangguiwei610 / CAMEL
Official implementation for the CVPR 2024 paper CAMEL
☆18Updated 10 months ago
Alternatives and similar repositories for CAMEL:
Users that are interested in CAMEL are comparing it to the libraries listed below
- ☆22Updated 3 weeks ago
- ☆39Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 4 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆31Updated 3 months ago
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆15Updated last month
- Video Diffusion State Space Models☆19Updated last year
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆17Updated 7 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆69Updated last week
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆24Updated 5 months ago
- ☆27Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆52Updated last month
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆77Updated 9 months ago
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"☆21Updated 3 months ago
- [AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation…☆35Updated 4 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Unified layout planning and image generation☆14Updated last week
- ☆18Updated last month
- ☆15Updated last year
- ☆19Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated 11 months ago
- 🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation☆53Updated last week
- ☆20Updated 7 months ago
- ☆33Updated 6 months ago
- An Empirical Study of GPT-4o Image Generation Capabilities☆11Updated this week
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆28Updated last month
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆29Updated 4 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆45Updated 4 months ago
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆17Updated last month