[CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
☆17May 14, 2024Updated last year
Alternatives and similar repositories for SelTDA
Users that are interested in SelTDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using image captions with LLM for zero-shot VQA☆19Mar 14, 2024Updated 2 years ago
- ☆18May 31, 2023Updated 2 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- 武汉大学遥感院摄影测量学作业 Homework of Course Photogrammetry in Wuhan University: Space-Resection(Photogrammetry) 空间后方交会☆11Jan 11, 2022Updated 4 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆20Sep 21, 2024Updated last year
- This is the repository for assignments of EECS598: Deep Learning for Computer Vision by professor Justin Johnson at the University of Mic…☆12Jul 27, 2022Updated 3 years ago
- Yet another LLM☆10Apr 6, 2023Updated 3 years ago
- ☆24Oct 9, 2023Updated 2 years ago
- ☆28Oct 18, 2022Updated 3 years ago
- ☆36Mar 31, 2026Updated last week
- 🦀 A tool for dump Tauri assets☆21Jan 6, 2026Updated 3 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2023] Generalized Logit Adjustment☆40Apr 21, 2024Updated last year
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…☆19Jan 23, 2018Updated 8 years ago
- ☆12Jan 16, 2020Updated 6 years ago
- ☆11Feb 12, 2018Updated 8 years ago
- ☆12Dec 20, 2024Updated last year
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 8 months ago
- 🐞 An extension for nvim-dap to provide C, C++, and Rust debugging support.☆42Mar 22, 2026Updated 2 weeks ago
- 数学建模常用算法,参考《MATLAB在数学建模中的应用·第二版》☆22Nov 16, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Vision-Language Dataset for Remote Sensing☆42May 27, 2025Updated 10 months ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆40Mar 27, 2025Updated last year
- A Modern Configuration/Registry System designed for deeplearning, with some utils.☆18Dec 23, 2025Updated 3 months ago
- Python Vascular Network Analysis☆27Feb 25, 2026Updated last month
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- dotfiles, batteries included☆21Jan 29, 2026Updated 2 months ago
- ☆12Mar 8, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Jan 10, 2025Updated last year
- 武大遥感2020级网络GIS课程设计☆24Dec 14, 2023Updated 2 years ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 3 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- Generative Bias for Robust Visual Question Answering ( CVPR 2023 )☆28Jul 4, 2023Updated 2 years ago