KimRass / CLIPLinks
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
☆11Updated last year
Alternatives and similar repositories for CLIP
Users that are interested in CLIP are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…☆18Updated 2 years ago
- The Hybrid Fake Face (HFF) dataset is built by exploiting the PGGAN, StyleGAN, Glow, and StarGAN.☆18Updated last year
- conditional U-Net Pytorch Implementation☆11Updated 4 years ago
- A collection of easy to understand diffusion model implementations in Pytorch.☆10Updated 3 years ago
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Updated last year
- 40 face attributes prediction on CelebA benchmark with PyTorch Implementation.☆22Updated 4 years ago
- Face Matching Repository☆12Updated 3 years ago
- ☆13Updated 2 years ago
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆15Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Updated 2 years ago
- This repo contains a PyTorch implementation of a CNN model for multi-label Image classification model deployed on heroku.☆14Updated 4 years ago
- ☆48Updated 4 years ago
- demo natural language video db using CLIP☆28Updated last year
- custom pytorch implementation of MoCo v3☆46Updated 4 years ago
- Easily compute model embeddings and save the embeddings.☆10Updated 3 years ago
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Updated 6 years ago
- This script is used to augment image data created using LabelMe-MIT.☆12Updated 4 years ago
- CVPR2023 paper☆52Updated 2 years ago
- ☆18Updated 2 years ago
- We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).☆19Updated last year
- 使用Qwen3的Embedding和Reranker模型实现查找与精排☆20Updated 7 months ago
- ☆20Updated 2 years ago
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆45Updated last year
- PyTorch implementation for our paper EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels☆28Updated 5 years ago
- Fashion 200K dataset used in paper "Automatic Spatially-aware Fashion Concept Discovery."☆68Updated 3 years ago
- Hegemony / Design-and-Implementation-of-Emotional-Face-Generation-Based-on-Generative-Adversarial-Networks☆13Updated 5 years ago
- Custom Iterable Dataset Class for Large-Scale Data Loading☆14Updated 4 years ago
- A image caption dataset about images from www.dpchallenge.com.☆20Updated 6 years ago
- Local Discriminative Regions for Scene Recognition (ACMMM 2018)☆22Updated 2 years ago
- Official repo for Directional Self-supervised Learning for Heavy Image Augmentations [CVPR2022]☆12Updated 3 years ago