zhangguiwei610 / CAMEL
Official implementation for the CVPR 2024 paper CAMEL
☆18Updated 10 months ago
Alternatives and similar repositories for CAMEL
Users that are interested in CAMEL are comparing it to the libraries listed below
Sorting:
- ☆23Updated last month
- ☆39Updated last year
- ReNeg: Learning Negative Embedding with Reward Guidance☆31Updated 4 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 5 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated last year
- Video Diffusion State Space Models☆19Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆25Updated 6 months ago
- ☆26Updated 2 months ago
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"☆22Updated 4 months ago
- ☆19Updated 2 years ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆40Updated last month
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- ☆33Updated 6 months ago
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆77Updated 10 months ago
- [AAAI 2025] Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation…☆35Updated 4 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆19Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 9 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 6 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated last year
- ☆52Updated 2 years ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated last year
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆57Updated 2 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆82Updated last month
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆15Updated last month
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated last year
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆28Updated this week
- ☆43Updated 4 months ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆84Updated last year
- ☆11Updated last year