KimRass / CLIPLinks
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
☆12Updated last year
Alternatives and similar repositories for CLIP
Users that are interested in CLIP are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…☆17Updated last year
- This repo contains a PyTorch implementation of a CNN model for multi-label Image classification model deployed on heroku.☆14Updated 4 years ago
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Updated 5 years ago
- The Hybrid Fake Face (HFF) dataset is built by exploiting the PGGAN, StyleGAN, Glow, and StarGAN.☆17Updated last year
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Updated last year
- ☆18Updated 2 years ago
- conditional U-Net Pytorch Implementation☆11Updated 4 years ago
- Face Matching Repository☆11Updated 3 years ago
- A collection of easy to understand diffusion model implementations in Pytorch.☆11Updated 2 years ago
- Download flickr8k, flickr30k image caption datasets☆30Updated last year
- ☆13Updated 2 years ago
- Image/Instance Retrieval using CLIP, A self supervised Learning Model☆29Updated 2 years ago
- custom pytorch implementation of MoCo v3☆46Updated 4 years ago
- CVPR2023 paper☆52Updated 2 years ago
- demo natural language video db using CLIP☆27Updated last year
- Non-local Modeling for Image Quality Assessment☆13Updated last year
- Custom Iterable Dataset Class for Large-Scale Data Loading☆14Updated 3 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Updated last year
- Code for the paper "Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection"☆14Updated 4 years ago
- Masked Vision-Language Transformer in Fashion☆36Updated 2 years ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Updated 2 years ago
- ☆29Updated 3 years ago
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆14Updated 9 months ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 4 years ago
- 40 face attributes prediction on CelebA benchmark with PyTorch Implementation.☆21Updated 3 years ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)☆80Updated 4 years ago
- A fine tune version of Stable Diffusion model on self-translate 10k diffusiondb Chinese Corpus and "extend" it☆32Updated 2 years ago
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆46Updated last year