DRSY / MoTISLinks

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

☆126

Alternatives and similar repositories for MoTIS

Users that are interested in MoTIS are comparing it to the libraries listed below

Sorting:

TheoCoombes / ClipCap
Using pretrained encoder and language models to generate captions from multimedia inputs.
☆97Updated 2 years ago
vladimir-chernykh / coreml-performance
Utility to test the performance of CoreML models.
☆70Updated 5 years ago
fguzman82 / CLIP-Finder2
CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…
☆84Updated last year
ryanwebster90 / snip-dedup
☆103Updated last year
Lednik7 / CLIP-ONNX
It is a simple library to speed up CLIP inference up to 3x (K80 GPU)
☆223Updated 2 years ago
Glovo / foodi-ml-dataset
☆59Updated last year
j-min / CLIP-Caption-Reward
PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
☆246Updated 3 months ago
LAION-AI / General-GPT
☆65Updated 2 years ago
applenob / clip_chinese_text_encoder
CLIP中文encoder
☆22Updated 3 years ago
LAION-AI / watermark-detection
A repository containing datasets and tools to train a watermark classifier.
☆71Updated 3 years ago
rom1504 / embedding-reader
Efficiently read embedding in streaming from any filesystem
☆101Updated last month
salesforce / LayoutDETR
The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'
☆100Updated 4 months ago
weiyx16 / CLIP-pytorch
A non-JIT version implementation / replication of CLIP of OpenAI in pytorch
☆34Updated 4 years ago
dhansmair / flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
☆168Updated 2 years ago
iejMac / clip-video-encode
Easily compute clip embeddings from video frames
☆146Updated last year
rom1504 / laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
☆222Updated last year
gregor-ge / mBLIP
☆87Updated last year
cardinalblue / clip-models-for-distillation
☆18Updated 2 years ago
Norod / U-2-Net-StyleTransfer
U-2-Net: U Square Net - Modified for paired image training of style transfer
☆51Updated 3 years ago
xuewyang / Fashion_Captioning
ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.
☆85Updated 2 years ago
Deferf / CLIP_Video_Representation
Use CLIP to represent video for Retrieval Task
☆70Updated 4 years ago
LAION-AI / video-clip
Let's make a video clip
☆95Updated 3 years ago
apple / ml-no-token-left-behind
☆140Updated 2 years ago
minimaxir / stable-diffusion-negative-prompt
Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.
☆87Updated 2 years ago
zsc / llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
☆110Updated 2 years ago
noagarcia / explain-paintings
Repository for the data in the paper "Explain Me the Painting: Multi-TopicKnowledgeable Art Description Generation".
☆20Updated 4 years ago
gregor-ge / Babel-ImageNet
☆23Updated last year
dzryk / antarctic-captions
☆112Updated 4 years ago
mlfoundations / imagenet-captions
Release of ImageNet-Captions
☆51Updated 2 years ago
SALT-NLP / LLaVAR
Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"
☆269Updated last year