☆25Jul 10, 2023Updated 2 years ago
Alternatives and similar repositories for semantic-image-text-alignment
Users that are interested in semantic-image-text-alignment are comparing it to the libraries listed below
Sorting:
- SotA text-only image/video method (IJCAI 2023)☆16Jan 9, 2024Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Paper list of compositional zero-shot learning☆11Jul 5, 2022Updated 3 years ago
- PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"☆11Sep 26, 2021Updated 4 years ago
- ☆12Jul 16, 2024Updated last year
- ☆11Sep 7, 2020Updated 5 years ago
- Code for paper: "Privately generating tabular data using language models".☆15Jun 13, 2023Updated 2 years ago
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- ☆14Aug 19, 2024Updated last year
- ☆14Jul 30, 2022Updated 3 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆23Nov 14, 2024Updated last year
- Implementation of the "Learn No to Say Yes Better" paper.☆39Oct 30, 2025Updated 4 months ago
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated last year
- ☆19Nov 7, 2022Updated 3 years ago
- Dual-Branch Network for Portrait Image Quality Assessment☆18Sep 16, 2025Updated 5 months ago
- ☆20May 3, 2025Updated 10 months ago
- [CVPR(W) 2022] UIGR: Unified Interactive Garment Retrieval☆22Dec 3, 2021Updated 4 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆61Jun 12, 2023Updated 2 years ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆55Nov 26, 2024Updated last year
- ☆58Aug 7, 2023Updated 2 years ago
- An introduction to conformal prediction☆27Jan 31, 2024Updated 2 years ago
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Jun 14, 2023Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- [CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation☆65Jul 29, 2025Updated 7 months ago
- ☆125Feb 21, 2023Updated 3 years ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆37Aug 18, 2024Updated last year
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Mar 24, 2025Updated 11 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- Generating Image Specific Text☆29Aug 14, 2023Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆84Jul 4, 2024Updated last year
- ☆33Mar 1, 2023Updated 3 years ago
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Mar 13, 2023Updated 2 years ago
- [CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning☆208Sep 30, 2022Updated 3 years ago
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆36Aug 8, 2024Updated last year