facebookresearch/clip-rocket

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/clip-rocket)

facebookresearch / clip-rocket

Code release for "Improved baselines for vision-language pre-training"

☆63

Alternatives and similar repositories for clip-rocket

Users that are interested in clip-rocket are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / EgocentricUserAdaptation
View on GitHub
In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…
☆15Updated this week
facebookresearch / FFCV-SSL
View on GitHub
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆217Aug 1, 2023Updated 2 years ago
facebookresearch / tce
View on GitHub
Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.
☆14Sep 15, 2023Updated 2 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
facebookresearch / VLaMP
View on GitHub
Code for “Pretrained Language Models as Visual Planners for Human Assistance”
☆64Jun 12, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DonkeyShot21 / essential-BYOL
View on GitHub
An essential implementation of BYOL in PyTorch + PyTorch Lightning
☆51Jul 15, 2021Updated 5 years ago
facebookresearch / iclmlp
View on GitHub
Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"
☆18May 29, 2023Updated 3 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
yuxiaochen1103 / FDT
View on GitHub
☆60Jun 16, 2023Updated 3 years ago
marco-garosi / CIRCLE
View on GitHub
[CVPR Findings 2026] Large Multimodal Models as General In-Context Classifiers
☆24Mar 1, 2026Updated 4 months ago
SivanDoveh / DAC
View on GitHub
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆28Nov 29, 2023Updated 2 years ago
jimmyxu123 / SELECT
View on GitHub
This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"
☆16Oct 8, 2024Updated last year
raytrun / mamba-clip
View on GitHub
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
☆83Aug 15, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
myokit / model-list
View on GitHub
A list of (detailed, non-stochastic) action potential models, with links to papers, source code, CellML and Myokit implementations
☆12Updated this week
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
valentineashio / Online-Payments-Fraud-Detection-Dataset-Case-Study
View on GitHub
A Data Science/Machine Learning Project. According to Bolster , Global Fraud Index (as at June 2022) is at 10,183 and growing. This is h…
☆15Jul 25, 2022Updated 4 years ago
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
MahmudulAlam / Holographic-Reduced-Representations
View on GitHub
Holographic Reduced Representations
☆37Dec 4, 2025Updated 7 months ago
zipengxuc / StylerDALLE
View on GitHub
Code for ICCV 2023 paper ✨ "StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Mo…
☆18Jan 25, 2024Updated 2 years ago
FutureComputing4AI / Learning-with-Holographic-Reduced-Representations
View on GitHub
☆38Oct 13, 2021Updated 4 years ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
yaox12 / BYOL-PyTorch
View on GitHub
PyTorch implementation of "Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning" with DDP and Apex AMP
☆83Sep 16, 2020Updated 5 years ago
ygjwd12345 / VISTA-Net
View on GitHub
The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"
☆14Nov 28, 2022Updated 3 years ago
ajtejankar / byol-convert
View on GitHub
Code and model definition for converting the official BYOL weights to PyTorch
☆33Nov 23, 2021Updated 4 years ago
elisakreiss / concadia
View on GitHub
☆16Jan 3, 2023Updated 3 years ago
AlonMendelson / SGVL
View on GitHub
☆17Dec 13, 2023Updated 2 years ago
yonatanbitton / wysiwyr
View on GitHub
☆37Oct 7, 2023Updated 2 years ago
megvii-research / protoclip
View on GitHub
📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)
☆56Nov 8, 2023Updated 2 years ago
wq2012 / VB_diarization
View on GitHub
VB Diarization with Eigenvoice and HMM Priors, refactored
☆14Jul 27, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
dual-view-molecule-pretraining / dmp
View on GitHub
☆11Jun 4, 2021Updated 5 years ago
facebookresearch / SIEVE
View on GitHub
SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)
☆21Apr 28, 2024Updated 2 years ago
pietroastolfi / suave-daino
View on GitHub
Code release for "Semi-supervised learning made simple with self-supervised clustering"
☆63Jun 13, 2023Updated 3 years ago
mittalgovind / GOTCHA-Deepfakes
View on GitHub
Official Repository for "GOTCHA: Real-Time Video Deepfake Detection via Challenge-Response"
☆11Jul 8, 2024Updated 2 years ago
azu20 / real-estate
View on GitHub
This MERN stack real estate application, uses Auth0 for login, it has the search functionality to look for properties listed for sale, an…
☆10Jan 6, 2021Updated 5 years ago
linusericsson / ssl-invariances
View on GitHub
Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".
☆16Dec 7, 2021Updated 4 years ago
EvolvingLMMs-Lab / MGPO
View on GitHub
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
☆55Jul 23, 2025Updated last year