yshinya6 / clip-refineView external linksLinks
Code repository for "Post-pre-training for Modality Alignment in Vision-Language Foundation Models" (CVPR2025)
☆38Jul 25, 2025Updated 6 months ago
Alternatives and similar repositories for clip-refine
Users that are interested in clip-refine are comparing it to the libraries listed below
Sorting:
- Papers from the intersection of surgery and data science / machine learning☆15Jan 28, 2024Updated 2 years ago
- Implementation of the "Learn No to Say Yes Better" paper.☆39Oct 30, 2025Updated 3 months ago
- Official code of the paper "EgoExOR: An Egocentric–Exocentric Operating Room Dataset for Comprehensive Understanding of Surgical Activiti…☆24Jan 21, 2026Updated 3 weeks ago
- SotA text-only image/video method (IJCAI 2023)☆16Jan 9, 2024Updated 2 years ago
- AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)☆56Mar 1, 2025Updated 11 months ago
- ☆34Jul 8, 2025Updated 7 months ago
- [ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"☆21Oct 23, 2024Updated last year
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆75May 23, 2025Updated 8 months ago
- SurgLaVi: Large-Scale Hierarchical Datasets for Surgical Vision–Language Representation Learning☆23Feb 2, 2026Updated last week
- ☆21May 18, 2025Updated 8 months ago
- [MedIA 2025] - Official repo for the paper: "Scaling up self-supervised learning for improved surgical foundation models"☆49Nov 25, 2025Updated 2 months ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 6 years ago
- Official repository of the GraSP dataset and implemention of TAPIS☆50Dec 31, 2024Updated last year
- A PyTorch tool kit for estimating the circular content area in endoscopic footage.☆11Nov 7, 2024Updated last year
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 8 months ago
- 使用DDPG算法解决rlschool中无人机悬停控制的问题(内含训练了9个小时的良模型)☆10Jul 7, 2020Updated 5 years ago
- Large-scale Self-supervised Pre-training for Endoscopy☆44Jun 11, 2024Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- Cover learning with geometric optimization☆12Sep 21, 2025Updated 4 months ago
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆32Jan 30, 2026Updated 2 weeks ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- Official code of the paper "A Stealthy Wrongdoer: Feature-Oriented Reconstruction Attack against Split Learning".☆15Sep 11, 2024Updated last year
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 7 months ago
- The Third Place Winner in Generative Track of the ECCV 2024 DD Challenge☆10Oct 11, 2024Updated last year
- ☆11Sep 30, 2024Updated last year
- GAN you see me? enhanced data reconstruction attacks against split inference - NeurIPS 2023☆12Mar 26, 2025Updated 10 months ago
- Revisiting K-Net for Real-Time Panoptic Segmentation. Code release for our IV 2023 paper.☆14Apr 25, 2025Updated 9 months ago
- PyTorch implementation of DINO (Self-Distillation with No Labels) from scratch.☆18May 13, 2025Updated 9 months ago
- This repository contains the python scripts developed as a part of the work presented in the paper "STAnet: A Spatiotemporal Attention Ne…☆15May 10, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Official implementation of "Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions"…☆14Jun 16, 2024Updated last year
- ☆13Jan 15, 2024Updated 2 years ago
- Reimplementation of paper "Image Super-Resolution by Neural Texture Transfer" in CVPR 2019 by pytorch.☆12Oct 22, 2019Updated 6 years ago
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- ☆10Nov 27, 2024Updated last year
- [ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"☆14Feb 15, 2025Updated last year