Davidelanz / pytorch-hedLinks
Python Package reimplementation of Holistically-Nested Edge Detection in PyTorch
☆12Updated 5 years ago
Alternatives and similar repositories for pytorch-hed
Users that are interested in pytorch-hed are comparing it to the libraries listed below
Sorting:
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆141Updated last month
- PyTorch code for MUST☆108Updated 9 months ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆414Updated 6 months ago
- ImageNet-Sketch data set for evaluating model's ability in learning (out-of-domain) semantics at ImageNet scale☆230Updated 3 years ago
- Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"☆113Updated 5 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆318Updated 2 years ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆188Updated 7 months ago
- Official repository for "Revisiting Weakly Supervised Pre-Training of Visual Perception Models". https://arxiv.org/abs/2201.08371.☆182Updated 3 years ago
- Release of ImageNet-Captions☆51Updated 3 years ago
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆224Updated 3 years ago
- ☆249Updated 3 years ago
- A PyTorch implementation of VIOLET☆140Updated 2 years ago
- A large-scale dataset for instance-level recognition for artworks is introduced.☆51Updated 2 years ago
- A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".☆84Updated last year
- Learning to Count without Annotations☆23Updated last year
- ☆65Updated 2 years ago
- [CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"☆405Updated 2 years ago
- Reliably download millions of images efficiently☆118Updated 4 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.☆134Updated 3 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380Updated 3 years ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆320Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆223Updated last year
- CLIPScore EMNLP code☆244Updated 3 years ago
- ☆13Updated 3 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆71Updated 4 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆135Updated 10 months ago
- Analyzing basic network responses to novel classes☆41Updated 4 years ago
- [CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning☆208Updated 3 years ago
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆281Updated 2 years ago