cure-lab / MMA-Diffusion
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
☆133Updated 5 months ago
Related projects: ⓘ
- ☆99Updated 2 months ago
- Improving fast adversarial training with prior-guided knowledge (TPAMI2024)☆35Updated 5 months ago
- Inference pipeline for some Text-to-Image metrics.☆35Updated 4 months ago
- [ICML22] "Revisiting and Advancing Fast Adversarial Training through the Lens of Bi-level Optimization" by Yihua Zhang*, Guanhua Zhang*, …☆72Updated last year
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆37Updated 4 months ago
- [CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation☆99Updated 6 months ago
- CVPR 2022 Workshop Robust Classification☆94Updated 2 years ago
- ☆353Updated 2 months ago
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆106Updated 5 months ago
- Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging (TIFS2024)☆32Updated 3 months ago
- [ECCV2022,oral] Identifying Hard Noise in Long-Tailed Sample Distribution☆75Updated 2 years ago
- This is the official reproduction of Qihoo-T2X.☆75Updated last week
- ☆76Updated 2 months ago
- [ICLR 2023] Official Tensorflow implementation of "Distributionally Robust Post-hoc Classifiers under Prior Shifts"☆39Updated 7 months ago
- Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet, CVPR2021 安全AI挑战者计划第六期:…☆54Updated 3 years ago
- An unofficial implementation of the paper "TopNet: Transformer-based Object Placement Network for Image Compositing", CVPR 2023.☆20Updated 3 months ago
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, C…☆144Updated 2 weeks ago
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆39Updated 7 months ago
- Visualization of DiT self attention features☆142Updated last month
- [NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Min…☆141Updated last year
- ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation☆166Updated 3 weeks ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆77Updated 3 months ago
- Quick scripts to calculate CLIP text-image similarity☆171Updated last year
- [CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video …☆112Updated 3 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆46Updated last month
- Code for paper:An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks☆58Updated 3 weeks ago
- This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral☆445Updated last month
- ☆17Updated 8 months ago
- Evaluating text-to-image/video/3D models with VQAScore☆169Updated last week
- Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Ne…☆133Updated 2 months ago