DRSY / MoTISLinks
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
☆126Updated 2 years ago
Alternatives and similar repositories for MoTIS
Users that are interested in MoTIS are comparing it to the libraries listed below
Sorting:
- Utility to test the performance of CoreML models.☆69Updated 5 years ago
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆85Updated last year
- A repository containing datasets and tools to train a watermark classifier.☆72Updated 3 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆246Updated 4 months ago
- U-2-Net: U Square Net - Modified for paired image training of style transfer☆51Updated 3 years ago
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆223Updated 4 years ago
- ALIGN trained on COYO-dataset☆29Updated last year
- Easily compute clip embeddings from video frames☆146Updated last year
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 5 months ago
- ☆87Updated last year
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆246Updated 9 months ago
- ☆65Updated 2 years ago
- ☆141Updated 2 years ago
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆223Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆102Updated 2 months ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- Let's make a video clip☆95Updated 3 years ago
- ☆103Updated last year
- M4 experiment logbook☆57Updated 2 years ago
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆222Updated last year
- ☆125Updated 2 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- Style Transfer a face into cartoon without GAN. A UNet++ network with MobileNet v3 backbone optimized for mobile frameworks☆30Updated 3 years ago
- This repo provides scripts for converting tensorflow and pytorch models to coreml for variety of tasks. Converted models like efficientDe…☆41Updated 5 years ago
- ☆18Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆167Updated 2 years ago
- ☆60Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆81Updated 2 years ago
- Diffusion-based markup-to-image generation☆83Updated 2 years ago