zengyan-97 / CCLMView external linksLinks
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))
☆92Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for CCLM
Users that are interested in CCLM are comparing it to the libraries listed below
Sorting:
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)☆167Aug 22, 2024Updated last year
- ☆25Mar 4, 2022Updated 3 years ago
- Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…☆19Jul 5, 2023Updated 2 years ago
- ☆88Jan 10, 2024Updated 2 years ago
- Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning☆21Jun 20, 2024Updated last year
- Multitask Multilingual Multimodal Pre-training☆73Nov 27, 2022Updated 3 years ago
- Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.☆74Oct 25, 2022Updated 3 years ago
- ☆15Apr 30, 2022Updated 3 years ago
- Cross-lingual Visual Pre-training for Multimodal Machine Translation☆18Dec 28, 2021Updated 4 years ago
- Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.☆119Jun 19, 2023Updated 2 years ago
- Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"☆167Jul 6, 2023Updated 2 years ago
- ☆47Jan 14, 2026Updated last month
- ☆18Mar 20, 2022Updated 3 years ago
- OpenAI CLIP text encoders for multiple languages!☆825May 15, 2023Updated 2 years ago
- METER: A Multimodal End-to-end TransformER Framework☆375Nov 16, 2022Updated 3 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆29Jul 14, 2023Updated 2 years ago
- Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".☆21May 25, 2022Updated 3 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- uvgVPCCenc is an open-source real-time V-PCC encoder library written in C++ from scratch.☆17Nov 27, 2025Updated 2 months ago
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Jan 16, 2024Updated 2 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- [EMNLP 2021] Code and data for our paper "Visually Grounded Reasoning across Languages and Cultures"☆30Dec 30, 2021Updated 4 years ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆62Dec 8, 2025Updated 2 months ago
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 9 months ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- This repository contains the code for our ECCV 2022 paper on our "Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning".☆12Dec 6, 2022Updated 3 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 5 months ago
- PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)☆579May 18, 2023Updated 2 years ago
- Code for ALBEF: a new vision-language pre-training method☆1,752Sep 20, 2022Updated 3 years ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆33Jul 16, 2025Updated 6 months ago
- Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…☆74Dec 6, 2023Updated 2 years ago
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆15Dec 25, 2023Updated 2 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆13May 29, 2024Updated last year
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Jul 15, 2021Updated 4 years ago
- Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Sc…☆12May 19, 2023Updated 2 years ago
- ☆28May 16, 2023Updated 2 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago