[CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
☆17May 14, 2024Updated last year
Alternatives and similar repositories for SelTDA
Users that are interested in SelTDA are comparing it to the libraries listed below
Sorting:
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering☆20Sep 21, 2024Updated last year
- ☆24Oct 9, 2023Updated 2 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- [NeurIPS 2023] Generalized Logit Adjustment☆40Apr 21, 2024Updated last year
- Counterfactual Reasoning VQA Dataset☆28Nov 23, 2023Updated 2 years ago
- [CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering☆54Jul 14, 2025Updated 8 months ago
- ☆12Dec 20, 2024Updated last year
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆39Mar 27, 2025Updated 11 months ago
- A Modern Configuration/Registry System designed for deeplearning, with some utils.☆18Dec 23, 2025Updated 2 months ago
- [NeurIPS 2024] Official PyTorch implementation of "Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives"☆46Dec 1, 2024Updated last year
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- ☆12Mar 8, 2021Updated 5 years ago
- ☆12Jan 10, 2025Updated last year
- ☆12Nov 30, 2022Updated 3 years ago
- MICCAI 2022: Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions☆12Sep 17, 2022Updated 3 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, …☆62Feb 26, 2026Updated 3 weeks ago
- Prebuilt TA-Lib wheels☆16Jun 30, 2025Updated 8 months ago
- nocaps: novel object captioning at scale☆10May 23, 2019Updated 6 years ago
- vue3+vite+pdfjs渲染pdf文件示例,本例演示了三种渲染模式,canvas渲染,html渲染,完整示例渲染☆11Feb 14, 2024Updated 2 years ago
- Segment graph convolutional neural network for relation classification. Paper in JAMIA.☆10May 13, 2019Updated 6 years ago
- ☆10May 4, 2018Updated 7 years ago
- Resources related to the model cards for ML☆11Mar 16, 2021Updated 5 years ago
- Official source code of ICDM2023 paper "Hypergraph Contrastive Learning for Drug Trafficking Community Detection".☆11Nov 3, 2023Updated 2 years ago
- ☆23Nov 4, 2024Updated last year
- [IEEE ICCBD+AI 2025] Spiking-Diffusion: Vector Quantized Discrete Diffusion Model with Spiking Neural Networks☆20Dec 24, 2023Updated 2 years ago
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- [NeurIPS 2021] Introspective Distillation for Robust Question Answering☆13Dec 7, 2021Updated 4 years ago
- Bayesian Optimization Meets Self-Distillation, ICCV 2023☆10Aug 28, 2023Updated 2 years ago
- A collection of papers related to Geo-spatial Information Science in CVPR 2025.☆39Apr 1, 2025Updated 11 months ago
- ☆16Feb 19, 2025Updated last year
- [ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'☆23Jan 1, 2025Updated last year
- PyTorch implementation of the computer vision related part of the paper "Unsupervised Data Augmentation for Consistency Training".☆10Mar 26, 2020Updated 5 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- Action recognition based on action graph, which describes the spatio-temporal relationship between dense trajectory clusters. The program…☆11Jan 7, 2015Updated 11 years ago