zengyan-97/CCLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zengyan-97/CCLM)

zengyan-97 / CCLM

Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))

☆93

Alternatives and similar repositories for CCLM

Users that are interested in CCLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zmykevin / UC2
View on GitHub
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Nov 9, 2021Updated 4 years ago
zengyan-97 / X-VLM
View on GitHub
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
☆507Nov 25, 2022Updated 3 years ago
adapter-hub / xGQA
View on GitHub
☆25Mar 4, 2022Updated 4 years ago
zengyan-97 / X2-VLM
View on GitHub
All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)
☆169Aug 22, 2024Updated last year
LiJiaBei-7 / nrccr
View on GitHub
Source code of our MM'22 paper Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning
☆21Jun 20, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
microsoft / M3P
View on GitHub
Multitask Multilingual Multimodal Pre-training
☆72Nov 27, 2022Updated 3 years ago
gregor-ge / mBLIP
View on GitHub
☆88Jan 10, 2024Updated 2 years ago
FudanDISC / weakly-supervised-mVLP
View on GitHub
Implementation of our ACL2023 paper: Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Langua…
☆19Jul 5, 2023Updated 3 years ago
kywen1119 / DSRAN
View on GitHub
Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.
☆74Oct 25, 2022Updated 3 years ago
rucmlcv / Wenlan-Video-Public
View on GitHub
☆18Mar 20, 2022Updated 4 years ago
ImperialNLP / VTLM
View on GitHub
Cross-lingual Visual Pre-training for Multimodal Machine Translation
☆18Dec 28, 2021Updated 4 years ago
uds-lsv / MCSE
View on GitHub
NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings
☆58Jun 10, 2024Updated 2 years ago
CrossmodalGroup / NAAF
View on GitHub
Implementation of our CVPR2022 paper, Negative-Aware Attention Framework for Image-Text Matching.
☆119Jun 19, 2023Updated 3 years ago
FreddeFrallan / Multilingual-CLIP
View on GitHub
OpenAI CLIP text encoders for multiple languages!
☆832May 15, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
WangFei-2019 / Image-text-Retrieval
View on GitHub
☆47Jan 14, 2026Updated 6 months ago
microsoft / BridgeTower
View on GitHub
Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"
☆168Jul 6, 2023Updated 3 years ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
GeorgeVern / smala
View on GitHub
Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".
☆13Sep 17, 2021Updated 4 years ago
yiren-jian / NonLing-CSE
View on GitHub
[NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
☆22Jan 30, 2023Updated 3 years ago
google-research-datasets / maxm
View on GitHub
MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…
☆13Jan 16, 2024Updated 2 years ago
scofield7419 / UMMT-VSH
View on GitHub
Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Sc…
☆12May 19, 2023Updated 3 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
ZhangXu0963 / VSL
View on GitHub
The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.
☆15Dec 25, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,755Sep 20, 2022Updated 3 years ago
kuanghuei / SCAN
View on GitHub
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
☆579May 18, 2023Updated 3 years ago
ImperialNLP / BertGen
View on GitHub
Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)
☆11Sep 17, 2023Updated 2 years ago
GeneZC / MiniMoE
View on GitHub
Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
☆29Jul 14, 2023Updated 3 years ago
zdou0830 / METER
View on GitHub
METER: A Multimodal End-to-end TransformER Framework
☆377Nov 16, 2022Updated 3 years ago
PKU-ICST-MIPL / MKVSE-TOMM2023
View on GitHub
☆28May 16, 2023Updated 3 years ago
mesnico / TERAN
View on GitHub
Code and Resources for the Transformer Encoder Reasoning and Alignment Network (TERAN), accepted for publication in ACM Transactions on M…
☆74Dec 6, 2023Updated 2 years ago
libeineu / fairseq_mmt
View on GitHub
This code repository is for the accepted ACL2022 paper "On Vision Features in Multimodal Machine Translation". We provide the details and…
☆43Sep 16, 2022Updated 3 years ago
LuminosityX / FNE
View on GitHub
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
☆20Dec 3, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JerryYLi / valhalla-nmt
View on GitHub
Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"
☆28Feb 19, 2023Updated 3 years ago
quangvnai / grit
View on GitHub
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
☆199May 9, 2023Updated 3 years ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
Vision-CAIR / artemis-v2
View on GitHub
Code for the paper: It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection
☆30Nov 27, 2022Updated 3 years ago
mad-red / VSR-guided-CIC
View on GitHub
Human-like Controllable Image Captioning with Verb-specific Semantic Roles.
☆36Mar 11, 2022Updated 4 years ago
shizhediao / T-DNA
View on GitHub
Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…
☆19Jan 12, 2023Updated 3 years ago
uta-smile / TCL
View on GitHub
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
☆270Oct 2, 2024Updated last year