Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", published at ICCV'23.
☆20Jun 3, 2024Updated last year
Alternatives and similar repositories for CLIPTrans
Users that are interested in CLIPTrans are comparing it to the libraries listed below
Sorting:
- ViT models pretrained with up to ~5k hours of human-like video data☆14Aug 10, 2023Updated 2 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated 10 months ago
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..☆20Dec 3, 2023Updated 2 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.☆30Sep 12, 2025Updated 6 months ago
- ☆10Apr 7, 2024Updated last year
- This repository contains the code for the IEEE Robotics and Automation Letters paper "Open-Set Object Detection Using Classification-Free…☆14Dec 6, 2023Updated 2 years ago
- A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.☆13Oct 10, 2023Updated 2 years ago
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'☆28Dec 3, 2023Updated 2 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- Code & data accompanying the paper ["Unveiling Implicit Deceptive Patterns in Multi-modal Fake News via Neuro-Symbolic Reasoning"].☆13Dec 21, 2023Updated 2 years ago
- ☆14May 7, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Apr 29, 2024Updated last year
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆15Dec 25, 2023Updated 2 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year
- ☆20Dec 16, 2024Updated last year
- ☆12May 3, 2024Updated last year
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- Open Vocabulary Learning for Neural Chinese Pinyin IME (ACL 2020)☆20Jun 12, 2019Updated 6 years ago
- Official code Implementation of "Text and Image Are Mutually Beneficial: Enhancing Training-Free Few-Shot Classification with CLIP" (AAA…☆21Dec 17, 2024Updated last year
- Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER☆22Jul 19, 2023Updated 2 years ago
- ☆17Oct 10, 2023Updated 2 years ago
- Official repository for the ICCV2023 paper SAFE: Sensitivity-Aware Features for Out-of-Distribution Object Detection☆13Jul 28, 2024Updated last year
- Code for the Globetrotter project☆23Mar 17, 2022Updated 4 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25May 16, 2024Updated last year
- Nearest Neighbor Normalization (EMNLP 2024)☆20Nov 1, 2024Updated last year
- [IEEE TPAMI 2025] REST: Holistic Learning for End-to-End Semantic Segmentation of Whole-Scene Remote Sensing Imagery☆36Updated this week
- 小白如何在几个小时内学会量化交易☆11Jan 5, 2021Updated 5 years ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆61Apr 8, 2024Updated last year
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆19Jul 5, 2023Updated 2 years ago
- Official implementation for "Out-of-Distribution Detection in Long-Tailed Recognition with Calibrated Outlier Class Learning" (AAAI'24)☆19Jun 12, 2024Updated last year
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- [ICCV 2023 Oral] Official repository for “On the Robustness of Open-World Test-Time Training: Self-Training with Dynamic Prototype Expans…☆47Dec 18, 2024Updated last year
- Implementation of "Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning" WACV 2023.☆26Sep 6, 2023Updated 2 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago