awsaf49 / flickr-dataset
Download flickr8k, flickr30k image caption datasets
☆13Updated last year
Alternatives and similar repositories for flickr-dataset:
Users that are interested in flickr-dataset are comparing it to the libraries listed below
- Deploy Swin Transformer using TorchServe☆27Updated 3 years ago
- Task Agnostic Unsupervised Learning☆15Updated 3 years ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- TensorFlow implementation of GhostNet: More Features from Cheap Operations.☆10Updated 5 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- ☆17Updated 4 years ago
- Three experiments for data efficient video transformers.☆9Updated 2 years ago
- Code for reproducing IS-Count: Large-scale Object Counting with Importance Sampling (AAAI 2022)☆26Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Updated 3 years ago
- Few training heuristics and small architectural changes that can significantly improve YOLOv3 performance with tiny increase in inference…☆12Updated 4 years ago
- ECCV 2022 Workshop: AI-enabled Medical Image Analysis – Digital Pathology & Radiology/COVID19 : An easy-to-understand and lightweight Tra…☆10Updated 7 months ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated last year
- SAM-CLIP module for use with Autodistill.☆13Updated last year
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Updated 2 years ago
- survery of small language models☆14Updated 6 months ago
- Multi-label classification based on timm, and add SimCLR to timm.☆37Updated 3 years ago
- image captioning paper list☆8Updated 5 years ago
- Official PyTorch code for HILA☆28Updated 2 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Updated 3 years ago
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆21Updated 2 years ago
- This repository including most of cnn visualizations techniques using pytorch☆14Updated 4 years ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆17Updated 4 months ago
- ViT trained on COYO-Labeled-300M dataset☆31Updated 2 years ago
- TF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).☆52Updated 3 years ago
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- Bag of MLP☆20Updated 3 years ago
- A Framework for Real-time Object Detection and Image Restoration☆18Updated 4 months ago
- Implementation for NATv2.☆23Updated 4 years ago
- Tiny ResNet inspired FPN network (<2M params) for Rotated Object Detection using 5-parameter Modulated Rotation Loss☆18Updated 3 years ago