This is an official implementation of GRIT-VLP
☆20Aug 8, 2022Updated 3 years ago
Alternatives and similar repositories for GRIT-VLP
Users that are interested in GRIT-VLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆56Aug 12, 2024Updated last year
- ☆21Nov 7, 2022Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆22Mar 19, 2022Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"☆46Mar 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Mar 1, 2024Updated 2 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 8 months ago
- ☆28Oct 18, 2022Updated 3 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 4 years ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆61May 26, 2024Updated last year
- Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"☆12Mar 26, 2026Updated last month
- ☆54Jul 31, 2022Updated 3 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A general purpose web app for connecting participants to engage in realtime conversations based on generated prompts.☆20Jun 21, 2023Updated 2 years ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆47Dec 1, 2024Updated last year
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 3 years ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆29Feb 27, 2026Updated 2 months ago
- Code and data for ImageCoDe, a contextual vison-and-language benchmark☆41Mar 1, 2024Updated 2 years ago
- Python wrapper around Yossi Rubner's Earth Mover's Distance implementation (http://ai.stanford.edu/~rubner/emd/default.htm)☆22Jul 9, 2015Updated 10 years ago
- A face detection base on faster-rcnn.pytorch☆10Feb 9, 2018Updated 8 years ago
- Find bitcoin network, cluster it, and visualize it.☆10Jul 24, 2015Updated 10 years ago
- ☆33Jul 28, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- Based on StackExchange.Redis that operates Tair For Redis Modules.☆11Feb 28, 2025Updated last year
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Apr 12, 2022Updated 4 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- Paper Today I Read☆29Apr 30, 2026Updated last week
- A paper list of Weakly Supervised Object Detection (WSOD) resources.☆13May 6, 2021Updated 5 years ago
- My implementation of 《Synthesizing Filamentary Structured Images with GANs》☆13Jun 1, 2018Updated 7 years ago
- ☆11Aug 27, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Research code for "Training Vision-Language Transformers from Captions Alone"☆33Jul 15, 2022Updated 3 years ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- MATE: Masked Autoencoders are Online 3D Test-Time Learners (ICCV 2023)☆23Jul 22, 2023Updated 2 years ago
- ☆22Dec 8, 2021Updated 4 years ago
- Easy and fast deep learning☆21May 30, 2018Updated 7 years ago
- Code repository for MMUGL: Multi-modal Graph Learning over UMLS Knowledge Graphs☆11Dec 7, 2023Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Aug 30, 2021Updated 4 years ago