Weili-NLP/UNIMO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Weili-NLP/UNIMO)

Weili-NLP / UNIMO

UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning

☆69

Alternatives and similar repositories for UNIMO

Users that are interested in UNIMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XLearning-SCU / 2021-CVPR-MRL
View on GitHub
Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)
☆13Apr 7, 2021Updated 5 years ago
jayleicn / mTVRetrieval
View on GitHub
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆27Aug 20, 2022Updated 3 years ago
ChenRocks / UNITER
View on GitHub
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
☆800Jun 30, 2021Updated 5 years ago
PKU-ICST-MIPL / MGAH_TMM2019
View on GitHub
Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"
☆12Jun 17, 2019Updated 7 years ago
facebookresearch / grid-feats-vqa
View on GitHub
Grid features pre-training code for visual question answering
☆269Sep 17, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AlexGreenLab / GARDN-SANDSTORM
View on GitHub
Code associated with the publication 'Generative and predictive neural networks for the design of functional RNA molecules'
☆15Apr 8, 2025Updated last year
ZJULearning / DMP
View on GitHub
Code for ACL 2018 paper "Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference".
☆17Aug 5, 2018Updated 7 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
BAAI-WuDao / LegalPLMs
View on GitHub
Source code and checkpoints for legal pre-trained language models.
☆14May 9, 2021Updated 5 years ago
PaddlePaddle / Research
View on GitHub
novel deep learning research works with PaddlePaddle
☆1,757Aug 16, 2024Updated last year
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
zhuchen03 / FreeLB
View on GitHub
Adversarial Training for Natural Language Understanding
☆252Sep 6, 2023Updated 2 years ago
etali / emf
View on GitHub
Word Embedding Revisted: Explicit Matrix Factorization
☆33Sep 12, 2017Updated 8 years ago
Paranioar / SGRAF
View on GitHub
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
☆220Apr 11, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yuewang-cuhk / awesome-vision-language-pretraining-papers
View on GitHub
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
☆1,159Aug 19, 2022Updated 3 years ago
jiasenlu / vilbert_beta
View on GitHub
☆478Nov 21, 2022Updated 3 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
malihealikhani / Cross-modal_Coherence_Modeling
View on GitHub
Cross-modal Coherence Modeling for Caption Generation
☆11Jul 24, 2020Updated 5 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆46Nov 25, 2020Updated 5 years ago
windx0303 / VIST-Challenge-NAACL-2018
View on GitHub
Official Github repo of the VIST Challenge NAACL 2018
☆17Aug 3, 2018Updated 7 years ago
haofanwang / natural-language-joint-query-search
View on GitHub
Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.
☆224Sep 9, 2021Updated 4 years ago
YushanZhu / K3M
View on GitHub
Code and Data for paper: Knowledge Perceived Multi-modal Pretraining in E-commerce (ACM MM2021)
☆28Oct 10, 2022Updated 3 years ago
JDAI-CV / image-captioning
View on GitHub
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
☆273Jul 27, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
airsplay / vokenization
View on GitHub
PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"
☆191Mar 8, 2021Updated 5 years ago
benywon / ComQA
View on GitHub
Comostional question answering
☆17Jun 18, 2021Updated 5 years ago
Serega6678 / NuNER
View on GitHub
NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition
☆15Jun 11, 2024Updated 2 years ago
yangxuntu / SGAE
View on GitHub
☆218Feb 26, 2022Updated 4 years ago
li-xirong / coco-cn
View on GitHub
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
☆214Feb 12, 2025Updated last year
salesforce / ALBEF
View on GitHub
Code for ALBEF: a new vision-language pre-training method
☆1,757Sep 20, 2022Updated 3 years ago
zinengtang / DeCEMBERT
View on GitHub
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Jan 12, 2023Updated 3 years ago
BAAI-WuDao / BriVL
View on GitHub
Bridging Vision and Language Model
☆286Mar 27, 2023Updated 3 years ago
jamespark3922 / visual-comet
View on GitHub
VisualCOMET: Reasoning about the Dynamic Context of a Still Image
☆87Jun 12, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nitikam / tangled
View on GitHub
Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evalua…
☆15Aug 13, 2020Updated 5 years ago
bcaitech1 / p4-fr-sorry-math-but-love-you
View on GitHub
a math-formula image recognition project which placed at the first place in a competition hosted by NAVER CONNECT boostcamp AI Tech
☆10Dec 16, 2023Updated 2 years ago
airsplay / lxmert
View on GitHub
PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".
☆967Oct 22, 2022Updated 3 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago
JindongGu / SimDis
View on GitHub
A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models
☆14Jul 15, 2021Updated 5 years ago
qingzwang / DiversityMetrics
View on GitHub
This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).
☆37Feb 26, 2022Updated 4 years ago
wuliwei9278 / SQL-Rank
View on GitHub
A Novel Listwise Collaborative Ranking Algorithm published at ICML'18
☆17Jan 17, 2019Updated 7 years ago