Code release for "Improved baselines for vision-language pre-training"
☆62May 6, 2024Updated 2 years ago
Alternatives and similar repositories for clip-rocket
Users that are interested in clip-rocket are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fréchet Joint Distance☆15Nov 27, 2019Updated 6 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15May 18, 2026Updated last week
- Official code repository for Instance Selection for GANs.☆44Jan 22, 2021Updated 5 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Removing Cost Volumes from Optical Flow Estimators (ICCV 2025 Oral)☆37Dec 2, 2025Updated 5 months ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆62Jun 12, 2023Updated 2 years ago
- An essential implementation of BYOL in PyTorch + PyTorch Lightning☆51Jul 15, 2021Updated 4 years ago
- Generative model for 3D high-resolution cardiac segmentation☆13Feb 25, 2022Updated 4 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- Needles in Haystacks: On Classifying Tiny Objects in Large Images☆23Jun 28, 2019Updated 6 years ago
- ☆10Jul 5, 2024Updated last year
- ☆61Jun 16, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- This the code for the paper "On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework, IC…☆12Jul 6, 2021Updated 4 years ago
- ☆29Jul 23, 2025Updated 10 months ago
- ☆20Apr 23, 2024Updated 2 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆18Feb 12, 2025Updated last year
- [WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆13Jan 3, 2024Updated 2 years ago
- Code for ICCV 2023 paper ✨ "StylerDALLE: Language-Guided Style Transfer Using a Vector-Quantized Tokenizer of a Large-Scale Generative Mo…☆18Jan 25, 2024Updated 2 years ago
- The offical Pytorch code for "Continual Attentive Fusion for Incremental Learning in Semantic Segmentation"☆16Apr 8, 2022Updated 4 years ago
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- PyTorch implementation of "Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning" with DDP and Apex AMP☆84Sep 16, 2020Updated 5 years ago
- The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"☆14Nov 28, 2022Updated 3 years ago
- ☆17Dec 13, 2023Updated 2 years ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆20Oct 17, 2024Updated last year
- 📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)☆55Nov 8, 2023Updated 2 years ago
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆19Apr 28, 2024Updated 2 years ago
- This is the repository for TimelineQA, a benchmark for querying lifelogs.☆26Jul 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A list of (detailed, non-stochastic) action potential models, with links to papers, source code, CellML and Myokit implementations☆12May 11, 2026Updated 2 weeks ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Nov 27, 2022Updated 3 years ago
- ☆37Oct 7, 2023Updated 2 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- ☆19Oct 16, 2023Updated 2 years ago
- EMMA [TMLR 2025]☆13Sep 25, 2025Updated 8 months ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago