[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆26Dec 5, 2023Updated 2 years ago
Alternatives and similar repositories for BLIText
Users that are interested in BLIText are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]☆11May 17, 2024Updated 2 years ago
- ☆18May 13, 2025Updated last year
- [NeurIPS 2023] The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" acce…☆28May 14, 2024Updated 2 years ago
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆13Apr 13, 2026Updated 2 months ago
- ☆22May 11, 2026Updated last month
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆33May 15, 2023Updated 3 years ago
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings☆18Mar 23, 2022Updated 4 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- ☆24Oct 9, 2023Updated 2 years ago
- ☆11May 9, 2023Updated 3 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25May 14, 2026Updated last month
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆52Apr 9, 2025Updated last year
- A meta-learning framework for few-shot graph learning☆20Mar 1, 2023Updated 3 years ago
- ☆30May 27, 2023Updated 3 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆38Aug 18, 2024Updated last year
- [ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…☆52Jul 3, 2024Updated last year
- This repo provides a demo for the ICML 2023 paper "Moderately Distributional Exploration for Domain Generalization" on the PACS dataset.☆15Jun 5, 2023Updated 3 years ago
- SimMatchV2: Semi-Supervised Learning with Graph Consistency☆22Dec 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions☆51May 26, 2023Updated 3 years ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated last year
- ☆11Jun 7, 2023Updated 3 years ago
- ☆18Aug 1, 2024Updated last year
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- We are developing a time series forecasting model using reinforcement learning, based on OneNet, for stock market data prediction.☆10Apr 19, 2024Updated 2 years ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆31May 29, 2023Updated 3 years ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 4 years ago
- This is the reading list of Large Language Model-Based Data Science Agent☆40Nov 3, 2025Updated 7 months ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆16May 23, 2025Updated last year
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆18Feb 17, 2025Updated last year
- Awesome autoregressive vision foundation models☆26Dec 24, 2024Updated last year
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆182Oct 1, 2024Updated last year