Release of ImageNet-Captions
☆51Jan 20, 2023Updated 3 years ago
Alternatives and similar repositories for imagenet-captions
Users that are interested in imagenet-captions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Mar 21, 2024Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- The download methods of Vision-language Continual Pretraining Dataset P9D.☆12Jan 3, 2025Updated last year
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆49Dec 18, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Mar 1, 2019Updated 7 years ago
- This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".☆46Nov 6, 2022Updated 3 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆12Jan 10, 2023Updated 3 years ago
- Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models☆28Nov 29, 2023Updated 2 years ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- ☆37Oct 7, 2023Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆79Aug 30, 2022Updated 3 years ago
- This repository provides code for the paper ---- On Universal Black-Box Domain Adaptation.☆10Sep 2, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "Removing Batch Normalization Boosts Adversarial Training" (ICML'22)☆19Jul 20, 2022Updated 3 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- A simple web app for visualizing the network structure of convolutional neural networks☆12Dec 27, 2017Updated 8 years ago
- ☆18May 25, 2022Updated 3 years ago
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models☆47Sep 25, 2023Updated 2 years ago
- We develop a black-box adversarial attack method against potential deepfake models based on image-to-image translation GANs utilizing 3 o…☆16Sep 14, 2021Updated 4 years ago
- NP DRAW paper released code☆20Mar 18, 2022Updated 4 years ago
- Funny Application of Neural Head Reenactment to Naver Webtoon☆10Mar 22, 2021Updated 5 years ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆151Apr 13, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models" ICLR 2024☆112Jun 11, 2024Updated last year
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆54Oct 20, 2022Updated 3 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆61Jun 12, 2023Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆408Nov 10, 2023Updated 2 years ago
- [ECCV 2022] "Adversarial Contrastive Learning via Asymmetric InfoNCE"☆24Dec 12, 2022Updated 3 years ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- Dual-path CNN with Max Gated block for Text-Based Person Re-identification☆10Dec 5, 2020Updated 5 years ago
- Code for "Nearest Neighbor Classifier Embedded Network for Active Learning", AAAI 2021☆10Feb 3, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Feb 27, 2022Updated 4 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Nov 29, 2021Updated 4 years ago
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆79Dec 9, 2023Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Create generated datasets and train robust classifiers☆36Sep 1, 2023Updated 2 years ago
- Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".☆13Jun 24, 2024Updated last year