james-oldfield / PoS-subspaces
[NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models
☆27Updated 11 months ago
Alternatives and similar repositories for PoS-subspaces:
Users that are interested in PoS-subspaces are comparing it to the libraries listed below
- [ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"☆58Updated last year
- ☆50Updated 2 years ago
- Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison☆17Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆77Updated 2 years ago
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆46Updated 3 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆37Updated 2 years ago
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆29Updated 4 months ago
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Updated 2 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆41Updated 2 years ago
- Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementati…☆34Updated 2 years ago
- ☆46Updated last year
- ☆34Updated last year
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- ☆84Updated 2 years ago
- Pytorch Implementation of PermutedAdaIN☆37Updated 3 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 2 years ago
- ☆31Updated 3 years ago
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆107Updated last year
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 2 years ago
- ☆35Updated 8 months ago
- ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…☆24Updated 3 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆38Updated 2 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 9 months ago
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆30Updated 6 months ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆19Updated 3 months ago
- ☆53Updated 2 years ago
- ☆26Updated last year
- PHASE annotations for societal bias in vision-and-language tasks.☆16Updated 8 months ago