IIGROUP/SCL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IIGROUP/SCL)

IIGROUP / SCL

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning

☆20

Alternatives and similar repositories for SCL

Users that are interested in SCL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
quicksviewer / quicksviewer
View on GitHub
☆19Jun 29, 2025Updated last year
e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 3 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
jpthu17 / DiCoSA
View on GitHub
[IJCAI 2023] Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
☆53Apr 9, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
YYJMJC / LOUPE
View on GitHub
☆45Aug 14, 2023Updated 2 years ago
QiuHeqian / mmdetection-ref
View on GitHub
☆10Jan 9, 2025Updated last year
yj-yu / CiSIN
View on GitHub
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Jan 17, 2021Updated 5 years ago
guilk / VLC
View on GitHub
Research code for "Training Vision-Language Transformers from Captions Alone"
☆33Jul 15, 2022Updated 4 years ago
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago
facebookresearch / diht
View on GitHub
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training
☆141Dec 16, 2025Updated 7 months ago
facebookresearch / reliable_vqa
View on GitHub
Implementation for the paper "Reliable Visual Question Answering Abstain Rather Than Answer Incorrectly" (ECCV 2022: https//arxiv.org/abs…
☆41May 19, 2023Updated 3 years ago
zipengxuc / SpectralCLIP
View on GitHub
Code for WACV 2024 paper ✨ "SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective".
☆19Nov 4, 2023Updated 2 years ago
FudanDISC / DISCOpen-MVPTR
View on GitHub
pytorch implementation of mvp: a multi-stage vision-language pre-training framework
☆11Apr 23, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NjtechCVLab / RSTPReid-Dataset
View on GitHub
RSTPReid Dataset for Text-based Person Retrieval.
☆35Sep 2, 2022Updated 3 years ago
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
xbdxwyh / mocose
View on GitHub
☆11Feb 14, 2023Updated 3 years ago
VL-Group / 2022-NeurIPS-DAA
View on GitHub
The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…
☆19Jan 16, 2024Updated 2 years ago
anse3832 / IQT
View on GitHub
Unofficial implementation of CVPR2021 paper "Perceptual Image Quality Assessment with Transformers"
☆76Oct 21, 2021Updated 4 years ago
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
hustvl / RND-SCI
View on GitHub
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive Imaging
☆11May 18, 2023Updated 3 years ago
jiyt17 / IDA-VLM
View on GitHub
[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
☆37Nov 27, 2024Updated last year
skhcjh231 / MATR_codebase
View on GitHub
☆22Mar 7, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
leolee99 / PAU
View on GitHub
[NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…
☆28May 14, 2024Updated 2 years ago
wxy11-27 / GMSR
View on GitHub
☆23Nov 26, 2024Updated last year
gabegrand / adversarial-vqa
View on GitHub
☆12Aug 14, 2019Updated 6 years ago
koc-lab / FrFNet
View on GitHub
The repository of the Fractional Fourier Transform Meets Transformer Encoder paper in IEEE Signal Processing Letters
☆10Oct 31, 2022Updated 3 years ago
ZYH-Lightyear / LVAS
View on GitHub
LVAS-Agent Code Base
☆21Apr 15, 2025Updated last year
YooSungHyun / attention-time-forecast
View on GitHub
attention으로 시계열 예측은 할 수 없을까
☆10Apr 30, 2021Updated 5 years ago
SaFo-Lab / seclaw
View on GitHub
🦾 SeClaw: The Security Armored Personal AI Assistant
☆31Mar 18, 2026Updated 4 months ago
heliossun / SQ-LLaVA
View on GitHub
Visual self-questioning for large vision-language assistant.
☆44Jul 23, 2025Updated 11 months ago
IIGROUP / PUM
View on GitHub
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19May 7, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hasakiXie123 / FedCMR
View on GitHub
FedCMR: Federated Cross-Modal Retrieval 的代码(the official implementation of FedCMR: Federated Cross-Modal Retrieval)
☆17Oct 17, 2025Updated 9 months ago
rabiulcste / vqazero
View on GitHub
visual question answering prompting recipes for large vision-language models
☆29Sep 14, 2024Updated last year
BasicCoder / SketchClassification
View on GitHub
Pytorch Sketch Classification
☆11Apr 14, 2018Updated 8 years ago
LooperXX / ManagerTower
View on GitHub
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆12Aug 23, 2025Updated 10 months ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
littlexinyi / MGCC
View on GitHub
The code of MGCC: Text-based Occluded Person Re-identification via Multi-Granularity Contrastive Consistency Learning
☆20Feb 26, 2025Updated last year
jiyounglee-0523 / FourierDecoder
View on GitHub
Official repository for Fourier model that can generate periodic signals
☆10Mar 10, 2022Updated 4 years ago