luo3300612 / LaRE
Official code for LaRE2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection. (CVPR 2024)
☆11Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for LaRE
- ☆44Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- Turning to Video for Transcript Sorting☆46Updated last year
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆25Updated last year
- ☆17Updated 9 months ago
- Official implementation of TagAlign☆32Updated 7 months ago
- [IJCV 2024] Code for DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection☆46Updated 3 weeks ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆18Updated last month
- [NeurIPS 2024] Lumen: a Large multimodal model with versatile vision-centric capabilities☆22Updated last month
- ☆31Updated 5 months ago
- ☆38Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- ☆56Updated 2 years ago
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆17Updated last month
- The collection of awesome papers on alignment of diffusion models.☆45Updated last week
- Teach-DETR: Better Training DETR with Teachers☆29Updated 7 months ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆13Updated last year
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆15Updated last year
- [AAAI 2023] The official implementation of "A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection"☆21Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆18Updated last year
- The offical implementation of 'FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant'☆14Updated last month
- An innovative method designed to augment the capabilities of existing video diffusion models☆21Updated 6 months ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆65Updated 8 months ago
- ☆17Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"☆37Updated 6 months ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- ☆14Updated 6 months ago