This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.
☆71Dec 20, 2021Updated 4 years ago
Alternatives and similar repositories for OTTER
Users that are interested in OTTER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Query Learning of Both Thing and Stuff for Panoptic Segmentation-ICIP-2022☆15Sep 3, 2022Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Nov 29, 2021Updated 4 years ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆143Jun 10, 2025Updated 9 months ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Sep 10, 2022Updated 3 years ago
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆63Apr 20, 2023Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- ☆19May 27, 2023Updated 2 years ago
- The official source code for the paper Consensus-Aware Visual-Semantic Embedding for Image-Text Matching (ECCV 2020)☆168Feb 7, 2022Updated 4 years ago
- [NeurIPS 2022] code for "K-LITE: Learning Transferable Visual Models with External Knowledge" https://arxiv.org/abs/2204.09222☆53Jun 12, 2023Updated 2 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 4 years ago
- L-Verse: Bidirectional Generation Between Image and Text☆107Apr 1, 2025Updated 11 months ago
- ☆43Aug 9, 2022Updated 3 years ago
- ☆11Jan 18, 2024Updated 2 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Mar 18, 2024Updated 2 years ago
- ☆40Jan 12, 2021Updated 5 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆317Aug 7, 2023Updated 2 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆237Aug 3, 2022Updated 3 years ago
- ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining☆97Nov 2, 2022Updated 3 years ago
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Dec 24, 2021Updated 4 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆67Oct 11, 2022Updated 3 years ago
- ☆22May 4, 2023Updated 2 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆677Sep 19, 2022Updated 3 years ago
- pytorch implementation of XMC-GAN☆11Jun 2, 2021Updated 4 years ago
- ☆188Nov 7, 2022Updated 3 years ago
- ☆14Feb 3, 2026Updated last month
- Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]☆89Oct 2, 2021Updated 4 years ago
- Collections of self-supervised methods, based on cvpods.☆57Aug 21, 2021Updated 4 years ago
- code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022☆268Oct 2, 2024Updated last year
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆22Aug 11, 2022Updated 3 years ago
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- [NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning☆177May 15, 2025Updated 10 months ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Apr 12, 2022Updated 3 years ago
- [CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning☆33May 25, 2025Updated 10 months ago
- Pixel-ImageNet☆45Feb 24, 2022Updated 4 years ago
- Authors official PyTorch implementation of the "ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences".☆42Oct 1, 2022Updated 3 years ago
- ☆35May 2, 2022Updated 3 years ago