awsaf49 / flickr-dataset
Download flickr8k, flickr30k image caption datasets
☆16Updated last year
Alternatives and similar repositories for flickr-dataset:
Users that are interested in flickr-dataset are comparing it to the libraries listed below
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆21Updated 7 months ago
- a simple pytorch implementation of diffusiom model☆13Updated 2 years ago
- A Framework for Real-time Object Detection and Image Restoration☆18Updated 6 months ago
- Task Agnostic Unsupervised Learning☆15Updated 3 years ago
- TRT for WSOL☆29Updated last year
- TensorFlow implementation of GhostNet: More Features from Cheap Operations.☆10Updated 5 years ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆33Updated 3 years ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆11Updated last year
- ☆12Updated 2 years ago
- Frontiers in Neuroinformatics 2022: Local Label Point Correction for Edge Detection of Overlapping Cervical Cells☆29Updated 10 months ago
- This is the repository for the corresponding 2022 MICCAI-MILLanD workshop paper "BoxShrink: From Bounding Boxes to Segmentation Masks"☆22Updated last year
- Fine Grained Visual Classification☆9Updated 3 years ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆17Updated 9 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- Deploy Swin Transformer using TorchServe☆27Updated 3 years ago
- Official PyTorch code for HILA☆28Updated 2 years ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆27Updated last year
- [ECCV2022] Revisiting the Critical Factors of Augmentation-Invariant Representation Learning☆12Updated 2 years ago
- ☆20Updated 4 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆22Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Updated 3 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- EfficientViT is a new family of vision models for efficient high-resolution vision.☆23Updated last year
- ☆15Updated 2 years ago
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Updated 3 years ago