The download methods of Vision-language Continual Pretraining Dataset P9D.
☆12Jan 3, 2025Updated last year
Alternatives and similar repositories for P9D
Users that are interested in P9D are comparing it to the libraries listed below
Sorting:
- [ICCV2023] - CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation☆38Oct 8, 2024Updated last year
- [ToMM2023] - AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval☆20Aug 30, 2024Updated last year
- [TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval☆23Aug 30, 2024Updated last year
- [NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.☆14Dec 12, 2024Updated last year
- Multimodal-Composite-Editing-and-Retrieval-update☆35Oct 13, 2025Updated 5 months ago
- ☆12Oct 24, 2023Updated 2 years ago
- C2P-CLIP-DeepfakeDetection☆94Dec 26, 2025Updated 2 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆89Sep 7, 2023Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Dataset for GAN-Generated Images Detection☆10Apr 25, 2024Updated last year
- A PyTorch Implementation of MLS-MPM (Moving Least Squares Material Point Method)☆24Mar 20, 2025Updated last year
- ☆11Apr 18, 2021Updated 4 years ago
- ☆28Feb 2, 2026Updated last month
- Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection☆17Mar 19, 2024Updated 2 years ago
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- ☆18Jun 10, 2023Updated 2 years ago
- official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)☆11Jun 16, 2022Updated 3 years ago
- [2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing☆14Nov 17, 2024Updated last year
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated 2 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- This repository is the official implementation of StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model☆20Jul 30, 2024Updated last year
- Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"☆17Jul 13, 2025Updated 8 months ago
- ☆19Jan 3, 2026Updated 2 months ago
- Release of ImageNet-Captions☆51Jan 20, 2023Updated 3 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency☆17Dec 2, 2021Updated 4 years ago
- ☆15Mar 31, 2022Updated 3 years ago
- ☆11Jan 2, 2026Updated 2 months ago
- ☆13Jun 17, 2024Updated last year
- Store articles for WeChat Public 'CVDaily'☆11Feb 7, 2018Updated 8 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆15Nov 25, 2025Updated 3 months ago
- [CVPR 2025] Six-CD: Benchmarking Concept Removals for Benign Text-to-image Diffusion Models☆16Jan 8, 2026Updated 2 months ago
- [NeurIPS 2024] SeeClear: This repo is the official implementation of "SeeClear: Semantic Distillation Enhances Pixel Condensation for Vid…☆18Oct 8, 2024Updated last year
- Unified Audio-Visual Perception for Multi-Task Video Localization☆31Apr 19, 2024Updated last year
- Conformal Prediction + Federated Learning☆15Mar 16, 2024Updated 2 years ago
- N-body simulation based on CUDA.☆14Jun 20, 2019Updated 6 years ago
- Federated Conformal Prediction with Quantile-of-Quantiles (FedCP-QQ)☆11Aug 16, 2023Updated 2 years ago
- [CVPR 2022] DiSparse: Disentangled Sparsification for Multitask Model Compression☆14Sep 6, 2022Updated 3 years ago
- ☆14Aug 7, 2019Updated 6 years ago