google-research/silc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/silc)

google-research / silc

[ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation

☆48

Alternatives and similar repositories for silc

Users that are interested in silc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SimarKareer / UnifiedVideoDA
View on GitHub
We're Not Using Videos Effectively (TMLR 2024)
☆17Feb 4, 2024Updated 2 years ago
CVMI-Lab / clip-beyond-tail
View on GitHub
(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
☆27Oct 28, 2024Updated last year
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
kawi19 / CFM
View on GitHub
Repository of CFM: Language-aligned Concept Foundation Model for Vision
☆21Apr 27, 2026Updated 2 months ago
brendel-group / clip-ood
View on GitHub
Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)
☆11Aug 26, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ExplainableML / cosmos
View on GitHub
[CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
☆42Mar 27, 2025Updated last year
m-parchami / FaCT
View on GitHub
Official code for FaCT: Faithful Concept Traces for Explaining Neural Network Decisions. NeurIPS 2025
☆20Mar 6, 2026Updated 4 months ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
Victorwz / MLM_Filter
View on GitHub
Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".
☆71Apr 14, 2025Updated last year
ugorsahin / Generative-Negative-Mining
View on GitHub
[WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024
☆13Jan 3, 2024Updated 2 years ago
james-oldfield / muMoE
View on GitHub
[NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
☆41Sep 30, 2024Updated last year
HUANGLIZI / MGLL
View on GitHub
[ICLR 2026] This repository is the official implementation of the paper “Boosting Medial Visual Understanding From Multi-Granular Languag…
☆25Jun 2, 2026Updated last month
haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
suryatejreddy / Memeify
View on GitHub
Code and Dataset for Memeify: A Large-scale Meme Generation System
☆25May 21, 2020Updated 6 years ago
boschresearch / GridSaliency-ToyDatasetGen
View on GitHub
Code for toy dataset generation of "Grid Saliency for Context Explanations of Semantic Segmentation" (https://arxiv.org/abs/1907.13054)
☆12Nov 28, 2019Updated 6 years ago
facebookresearch / DCI
View on GitHub
Densely Captioned Images (DCI) dataset repository.
☆197Jul 1, 2024Updated 2 years ago
MrZilinXiao / AutoVER
View on GitHub
[ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.
☆14Mar 2, 2024Updated 2 years ago
wuw2019 / LoTLIP
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
☆49Jan 14, 2025Updated last year
deepglint / Victor
View on GitHub
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
☆29Aug 15, 2025Updated 11 months ago
jiyounglee-0523 / VisAlign
View on GitHub
☆20Apr 23, 2024Updated 2 years ago
OpenGVLab / V2PE
View on GitHub
[ICCV2025] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆60Apr 4, 2026Updated 3 months ago
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
moukamisama / Recon
View on GitHub
☆12Apr 18, 2023Updated 3 years ago
m1k2zoo / negbench
View on GitHub
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆47Feb 26, 2026Updated 4 months ago
Zjut-MultimediaPlus / PIR-pytorch
View on GitHub
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval (MM'23 Oral)
☆15Dec 8, 2023Updated 2 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
SSSKJ / HeroLT
View on GitHub
☆11Aug 14, 2024Updated last year
simplelifetime / TIVE
View on GitHub
Less is More: High-value Data Selection for Visual Instruction Tuning
☆20Jan 18, 2025Updated last year
elisakreiss / concadia
View on GitHub
☆16Jan 3, 2023Updated 3 years ago
junhua / EPIC
View on GitHub
EPIC: a large collection of over 30 million epidemic-related tweets
☆12Jul 28, 2020Updated 5 years ago
janghyuncho / DECOLA
View on GitHub
Code release for "Language-conditioned Detection Transformer"
☆86Jun 17, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
Simonlee711 / Clinical_ModernBERT
View on GitHub
[arXiv 2025] Pre-training script for Clinical ModernBERT
☆37Apr 29, 2025Updated last year
bcmi / Awesome-Foreground-Object-Search
View on GitHub
A curated list of papers, code, and resources pertaining to foreground object search.
☆10Feb 24, 2026Updated 5 months ago
facebookresearch / SIEVE
View on GitHub
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)
☆21Apr 28, 2024Updated 2 years ago
mehdie79 / RTM_latent_refinement
View on GitHub
☆22Jul 10, 2026Updated 2 weeks ago
aimagelab / safe-clip
View on GitHub
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024
☆67Aug 10, 2024Updated last year
princetonvisualai / icons
View on GitHub
☆22Apr 24, 2025Updated last year