bubbliiiing / clip-pytorch
这是一个clip-pytorch的模型,可以训练自己的数据集。
☆206Updated last year
Alternatives and similar repositories for clip-pytorch:
Users that are interested in clip-pytorch are comparing it to the libraries listed below
- A deep learning code base, mainly for paper replication, in the areas of image recognition, object detection, image segmentation, self-su…☆329Updated 2 years ago
- ☆170Updated 3 months ago
- a super easy clip model with mnist dataset for study☆88Updated 10 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆94Updated last year
- Awesome Fine-grained Visual Classification☆218Updated last year
- image classifier implement in pytoch.☆108Updated last year
- Centralized Feature Pyramid for Object Detection☆246Updated last year
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆518Updated last year
- TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction☆330Updated 3 weeks ago
- 这是一个DETR-pytorch的仓库,可以训练自己的数据集☆177Updated last year
- 这是各个主干网络分类模型的源码,可以用于训练自己的分类模型。☆412Updated 2 years ago
- [CVPR 2024] Code release for TransNeXt model☆470Updated 7 months ago
- ☆204Updated 4 months ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆170Updated last year
- 这是一个segformer-pytorch的源码,可以用于训练自己的模型。☆312Updated last year
- 基于Swin-transformer训练图像分类并部署web端☆85Updated 2 years ago
- 深度学习/计算机视觉/多模态/机器学习/人工智能零基础理论/实战教程汇总分享☆134Updated 2 years ago
- This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆197Updated last year
- 利用Segment Anything(SAM)模型进行快速标注☆212Updated last month
- ☆122Updated 4 months ago
- ☆118Updated 4 months ago
- deep learning for image processing including classification and object-detection etc.☆24Updated 3 years ago
- ☆97Updated 9 months ago
- 这是一个blip-pytorch简化的代码,适用于了解Attention与Transformer的结构。☆45Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆183Updated 5 months ago
- A tutorial of finetune segment anything used VOC2007☆24Updated 5 months ago
- GroupMixAttention and GroupMixFormer☆114Updated last year
- 多模态 MM +Chat 合集☆238Updated last week
- ☆99Updated last year
- RAFConv: Innovating Spatital Attention and Standard Convolutional Operation☆156Updated 5 months ago