Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", published at ICCV'23.
โ20Jun 3, 2024Updated last year
Alternatives and similar repositories for CLIPTrans
Users that are interested in CLIPTrans are comparing it to the libraries listed below
Sorting:
- ViT models pretrained with up to ~5k hours of human-like video dataโ14Aug 10, 2023Updated 2 years ago
- ๐ ๐ Auto check for new apartments in Hamburg from various real estate providesโ16Jun 2, 2024Updated last year
- A library for data streaming and augmentationโ21May 5, 2025Updated 9 months ago
- [ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".โ12Oct 11, 2024Updated last year
- โ19Apr 28, 2023Updated 2 years ago
- Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..โ20Dec 3, 2023Updated 2 years ago
- โ20May 3, 2025Updated 9 months ago
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.โ19Jun 7, 2024Updated last year
- Open Vocabulary Learning for Neural Chinese Pinyin IME (ACL 2020)โ20Jun 12, 2019Updated 6 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)โ25May 16, 2024Updated last year
- [CVPR 2024] Joint-Task Regularization for Partially Labeled Multi-Task Learningโ24May 31, 2024Updated last year
- Serverless Optimized MODules - A Serverless Framework to create reusable micro appsโ18Jul 7, 2025Updated 7 months ago
- Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'โ28Dec 3, 2023Updated 2 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrievalโ26Jun 28, 2021Updated 4 years ago
- VisualGPTScore for visio-linguistic reasoningโ27Oct 7, 2023Updated 2 years ago
- โ36May 24, 2024Updated last year
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformersโ34Dec 30, 2024Updated last year
- โ10Feb 10, 2026Updated 2 weeks ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribuโฆโ10Jun 14, 2018Updated 7 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022โ11Apr 13, 2025Updated 10 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selectorโ37Mar 7, 2024Updated last year
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"โ41Aug 9, 2022Updated 3 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)โ45Nov 29, 2023Updated 2 years ago
- Grab some/all of CodeQL CLI binary, QL library, VSCode starter workspace, VSCode and VSCode QL extensionโ11Jun 12, 2025Updated 8 months ago
- A framework for steering MoE models by detecting and controlling behavior-linked experts.โ29Sep 12, 2025Updated 5 months ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?โ11Jan 3, 2019Updated 7 years ago
- The sparse Bayesian learning sandboxโ11Jul 4, 2021Updated 4 years ago
- Dataset for bounding box labels and terrain meshes of the POLAR databaseโ10Jul 10, 2025Updated 7 months ago
- โ17Updated this week
- โ10Updated this week
- ่็5.1ๅฎคๅ ๅฎไฝโ12Jun 8, 2022Updated 3 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Modelโ13Feb 15, 2024Updated 2 years ago
- Data Programming for Text Detection in Documents using SPEARโ12Mar 26, 2025Updated 11 months ago
- โ10Nov 15, 2023Updated 2 years ago
- โ10Apr 7, 2024Updated last year
- โ41Mar 27, 2024Updated last year
- Partially Non-Autoregressive Image Captioningโ10Sep 30, 2021Updated 4 years ago
- โ12Jun 26, 2024Updated last year
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Converโฆโ10Jul 21, 2023Updated 2 years ago