ChiYeungLaw / LexLIP-ICCV23View external linksLinks
Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval"
☆40Oct 14, 2023Updated 2 years ago
Alternatives and similar repositories for LexLIP-ICCV23
Users that are interested in LexLIP-ICCV23 are comparing it to the libraries listed below
Sorting:
- Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training☆30Jun 20, 2023Updated 2 years ago
- Code for the paper "DMCL: Distillation Multiple Choice Learning for Multimodal Action Recognition"☆15Jan 17, 2020Updated 6 years ago
- [CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…☆44Oct 29, 2025Updated 3 months ago
- ☆15Apr 30, 2022Updated 3 years ago
- This repo contains the official implementation of HAPPIER: Hierarchical Average Precision Training for Pertinent Image Retrieval (ECCV'22…☆23Apr 6, 2023Updated 2 years ago
- ☆53Sep 13, 2023Updated 2 years ago
- [SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".☆23Apr 29, 2022Updated 3 years ago
- ☆30Mar 13, 2024Updated last year
- code for EACL2024-main:Generative Dense Retrieval: Memory Can Be a Burden☆32Jan 19, 2024Updated 2 years ago
- A Python interface to PISA☆37Sep 23, 2025Updated 4 months ago
- [TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”☆34Apr 11, 2024Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆83Jul 4, 2024Updated last year
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Mar 12, 2024Updated last year
- [ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives☆39Sep 9, 2025Updated 5 months ago
- Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".☆33May 23, 2023Updated 2 years ago
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆62Oct 9, 2025Updated 4 months ago
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆46Sep 8, 2025Updated 5 months ago
- Reproduce KGAT using DGL☆35Dec 17, 2019Updated 6 years ago
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- A fast heuristic search algorithm for finding the longest common subsequence of multiple strings☆10Nov 22, 2023Updated 2 years ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆52Jul 3, 2024Updated last year
- All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)☆167Aug 22, 2024Updated last year
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Generative label fused network for image–text matching☆10Jan 13, 2023Updated 3 years ago
- Eagle and EagleSim: Deep-RL for PTZ Cameras☆10Aug 23, 2024Updated last year
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- ☆11Aug 1, 2024Updated last year
- ☆20Nov 21, 2025Updated 2 months ago
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Jun 12, 2023Updated 2 years ago
- This is the Capstone Project of Udacity Machine Learning Nanodegree.☆12Jul 31, 2017Updated 8 years ago
- THE VISUAL COMPUTER “High-level LoRA and hierarchical fusion for enhanced micro-expression recognition”☆13Oct 12, 2024Updated last year
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- [🔥ACM MM2025] EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation☆23Dec 30, 2025Updated last month
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- A simply deep learning based blur image detector.☆10Mar 29, 2023Updated 2 years ago
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 3 years ago