bubbliiiing / clip-pytorchLinks
这是一个clip-pytorch的模型,可以训练自己的数据集。
☆228Updated 2 years ago
Alternatives and similar repositories for clip-pytorch
Users that are interested in clip-pytorch are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆92Updated 2 years ago
- TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction☆401Updated 3 months ago
- A deep learning code base, mainly for paper replication, in the areas of image recognition, object detection, image segmentation, self-su…☆340Updated 2 years ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆297Updated last month
- ☆150Updated last year
- 这是一个blip-pytorch简化的代码,适用于了解Attention与Transformer的结构。☆51Updated last year
- Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense…☆299Updated 4 months ago
- GroupMixAttention and GroupMixFormer☆116Updated last year
- [TNNLS 2025] TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆201Updated last month
- ☆224Updated 2 months ago
- a super easy clip model with mnist dataset for study☆117Updated last year
- [CVPR 2024] Code release for TransNeXt model☆526Updated 11 months ago
- 计算机视觉课程设计-基于Chinese-CLIP的图文检索系统☆66Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆209Updated last year
- [CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels☆224Updated last week
- Awesome Fine-grained Visual Classification☆231Updated last year
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆310Updated 2 months ago
- Official implementation of SCTNet (AAAI2024)☆275Updated last year
- detr官方源码中文注释版!☆75Updated 2 years ago
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆349Updated 10 months ago
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆545Updated 2 years ago
- [CVPR 2024] Official implementation of the paper "Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Ref…☆195Updated last month
- ☆254Updated last year
- Implementation Code for the ICCASSP 2023 paper " Efficient Multi-Scale Attention Module with Cross-Spatial Learning" and is available at:…☆245Updated 5 months ago
- ☆618Updated last year
- [TNNLS] A Comprehensive Survey of Awesome Visual Transformer Literatures.☆259Updated 2 years ago
- [ICLR 2023 & IJCV 2025] SeaFormer: Squeeze-enhanced Axial Transformer☆325Updated 3 months ago
- 基于Swin-transformer训练图像分类并部署web端☆93Updated 2 years ago
- 这里包含了Mamba的代码以及b站对应的讲解视频☆86Updated last year
- Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval (CVPR 2023)☆235Updated 2 months ago