DRSY / MoTISLinks
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
☆124Updated 2 years ago
Alternatives and similar repositories for MoTIS
Users that are interested in MoTIS are comparing it to the libraries listed below
Sorting:
- Utility to test the performance of CoreML models.☆70Updated 5 years ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆97Updated 2 years ago
- OpenAI CLIP coreML version for iOS text-image embeddings, image search, image clustering, image classifiy☆19Updated 2 years ago
- Easily compute clip embeddings from video frames☆145Updated last year
- Efficiently read embedding in streaming from any filesystem☆100Updated last year
- ☆60Updated last year
- ALIGN trained on COYO-dataset☆29Updated last year
- ☆86Updated last year
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆167Updated 2 years ago
- ☆19Updated last year
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆243Updated last month
- A repository containing datasets and tools to train a watermark classifier.☆70Updated 3 years ago
- This repo provides scripts for converting tensorflow and pytorch models to coreml for variety of tasks. Converted models like efficientDe…☆39Updated 5 years ago
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆80Updated 11 months ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- ☆64Updated last year
- Let's make a video clip☆96Updated 2 years ago
- CLIP中文encoder☆22Updated 3 years ago
- ☆141Updated 2 years ago
- ☆23Updated last year
- ☆46Updated 3 years ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆21Updated last year
- Get hundred of million of image+url from the crawling at home dataset and preprocess them☆220Updated last year
- Repository for the data in the paper "Explain Me the Painting: Multi-TopicKnowledgeable Art Description Generation".☆20Updated 3 years ago
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆138Updated 2 years ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Updated 2 years ago