zhangguiwei610 / CAMELLinks
Official implementation for the CVPR 2024 paper CAMEL
☆19Updated last year
Alternatives and similar repositories for CAMEL
Users that are interested in CAMEL are comparing it to the libraries listed below
Sorting:
- ☆24Updated 2 months ago
- ☆39Updated last year
- ReNeg: Learning Negative Embedding with Reward Guidance☆32Updated 5 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆25Updated 7 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 7 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated last year
- ☆33Updated 8 months ago
- Video Diffusion State Space Models☆19Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆85Updated 11 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆31Updated 7 months ago
- ☆29Updated last year
- ☆26Updated 3 months ago
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"☆26Updated 6 months ago
- The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"☆13Updated 4 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆64Updated last year
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated 2 years ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆18Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 9 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated last year
- ☆19Updated 2 years ago
- ☆95Updated this week
- [CVPR 2025] "DiC: Rethinking Conv3x3 Designs in Diffusion Models", a performant & speedy Conv3x3 diffusion model.☆24Updated 2 weeks ago
- ☆11Updated last year
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆20Updated 3 months ago
- Unified layout planning and image generation☆21Updated 2 months ago
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption 🔍☆42Updated 2 months ago
- Code release for "BoxVIS: Video Instance Segmentation with Box Annotation"☆11Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated last month
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year