xiaoshutongly / clip-lora
☆13Updated last year
Related projects: ⓘ
- A paper list of some recent works about Token Compress for Vit and VLM☆32Updated last week
- Official implementation of TagAlign☆31Updated 5 months ago
- OpenCompatible provides a standard compatible training benchmark, covering practical training scenarios.☆24Updated 2 years ago
- IJCV22 Attack your retrieval model via Query! They are not robust as you expected!☆47Updated last year
- This repo holds the competitions (information, solutions, summaries, memories) that our team has participated in☆25Updated 7 months ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated last year
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 3 years ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆89Updated last year
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆22Updated 2 years ago
- ☆100Updated 7 months ago
- ChineseCLIP using online learning☆12Updated last year
- ☆23Updated last year
- ☆27Updated 2 years ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆13Updated 7 months ago
- Implementation for Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification☆51Updated 2 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Updated last year
- ☆44Updated last year
- ☆34Updated 2 years ago
- ☆24Updated last year
- [ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching☆69Updated 9 months ago
- ☆17Updated this week
- [AAAI 2023] The official implementation of "A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection"☆21Updated last year
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated last year
- Turning to Video for Transcript Sorting☆44Updated last year
- ☆9Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆111Updated 2 months ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆91Updated 7 months ago
- ☆83Updated 9 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆32Updated 7 months ago
- Large Multimodal Model☆15Updated 5 months ago