MichaelZhouwang/VLUE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MichaelZhouwang/VLUE)

MichaelZhouwang / VLUE

This repo contains codes and instructions for baselines in the VLUE benchmark.

☆41

Alternatives and similar repositories for VLUE

Users that are interested in VLUE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

e-bug / fine-grained-evals
View on GitHub
[ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"
☆13Jun 11, 2023Updated 3 years ago
zengyan-97 / X-VLM
View on GitHub
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
☆507Nov 25, 2022Updated 3 years ago
shizhediao / DaVinci
View on GitHub
Source code for the paper "Prefix Language Models are Unified Modal Learners"
☆45Apr 30, 2023Updated 3 years ago
zengyan-97 / X2-VLM
View on GitHub
All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)
☆169Aug 22, 2024Updated last year
zmykevin / UC2
View on GitHub
CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training
☆34Nov 9, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
uta-smile / TCL
View on GitHub
code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022
☆271Oct 2, 2024Updated last year
MichaelZhouwang / Sequence_Span_Rewriting
View on GitHub
Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting
☆17Nov 30, 2021Updated 4 years ago
EastTower16 / LLMDataDistill
View on GitHub
distill large scale web page text
☆12Jul 29, 2023Updated 2 years ago
swaggy-TN / EfficientVLM
View on GitHub
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)
☆33Jul 18, 2023Updated 3 years ago
e-bug / iglue
View on GitHub
[ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"
☆49Dec 7, 2022Updated 3 years ago
gmftbyGMFTBY / MomentumDecoding
View on GitHub
Momentum Decoding: Open-ended Text Generation as Graph Exploration
☆19Jan 27, 2023Updated 3 years ago
jokieleung / Maria
View on GitHub
PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".
☆23Sep 19, 2021Updated 4 years ago
jeykigung / HiCLIP
View on GitHub
☆31Mar 2, 2023Updated 3 years ago
yiren-jian / NonLing-CSE
View on GitHub
[NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings
☆22Jan 30, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MGheini / xattn-transfer-for-mt
View on GitHub
Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…
☆33Sep 15, 2021Updated 4 years ago
zengyan-97 / MultiT-C-Dialog
View on GitHub
A multi-task learning approach for conditioned response generation (NAACL 2021)
☆12Nov 18, 2022Updated 3 years ago
guxu313 / TeViS
View on GitHub
☆21Aug 26, 2025Updated 10 months ago
Aman-4-Real / OpenDomainDialogCorpus
View on GitHub
Open domain Chinese dialogue corpus and datasets.
☆17Jan 8, 2022Updated 4 years ago
causalNLP / AI-Scholar
View on GitHub
☆23Dec 8, 2022Updated 3 years ago
Theo-Jaunet / VisQA
View on GitHub
Online visual analytics tool designed to investigate how attention maps in transformer models behaves, and build hypothesis on those mode…
☆10Nov 10, 2021Updated 4 years ago
Aman-4-Real / MMTG
View on GitHub
[ACM MM 2022] (Oral): Multi-Modal Experience Inspired AI Creation
☆21Nov 27, 2024Updated last year
Aman-4-Real / CrEval
View on GitHub
[ICLR 2026] Evaluating Text Creativity across Diverse Domains: A Dataset and a Large Language Model Evaluator
☆18Feb 28, 2026Updated 4 months ago
chuhaojin / WenLan-api-document
View on GitHub
The Document of WenLan API, which was used to obtain image and text feature.
☆41Jan 10, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RWKV-Wiki / MultilingualShareGPT
View on GitHub
MultilingualShareGPT, the free multi-language corpus for LLM training
☆72Apr 6, 2023Updated 3 years ago
Kilichbek / artemis-speaker-tools-b
View on GitHub
Artemis Speaker Tools B
☆24Apr 4, 2021Updated 5 years ago
Aman-4-Real / awesome-multimodal-dialogue
View on GitHub
Paper, dataset and code list for multimodal dialogue.
☆22Jan 2, 2025Updated last year
TsinghuaAI / CPM-2-Pretrain
View on GitHub
Code for CPM-2 Pre-Train
☆157Mar 18, 2023Updated 3 years ago
AlbertTan404 / RoLD
View on GitHub
[MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling
☆24Aug 4, 2024Updated last year
Yuhan-Shen / VisualNarrationProceL-CVPR21
View on GitHub
☆15May 23, 2023Updated 3 years ago
ThomasScialom / T0_continual_learning
View on GitHub
Adding new tasks to T0 without catastrophic forgetting
☆33Oct 20, 2022Updated 3 years ago
lfovia / QAGANS
View on GitHub
Quality Aware Generative Adversarial Networks
☆20Apr 15, 2020Updated 6 years ago
jmhessel / pycocoevalcap
View on GitHub
Python 3 support for the MS COCO caption evaluation tools
☆14Jun 14, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zengyan-97 / CCLM
View on GitHub
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))
☆93Jun 12, 2023Updated 3 years ago
coldmanck / RVL-BERT
View on GitHub
The official code for "Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations" (IEEE Access, 2021…
☆18Oct 21, 2022Updated 3 years ago
sinovation / ZEN2
View on GitHub
The enhanced version of ZEN, larger and more powerful.
☆31Jul 22, 2022Updated 4 years ago
idstcv / InMaP
View on GitHub
PyTorch Implementation for InMaP
☆12Oct 28, 2023Updated 2 years ago
SALT-NLP / Impressions
View on GitHub
Dataset for the investigation of visual semiotics, and how specific visual features and design choices can elicit specific emotions, thou…
☆11Dec 13, 2023Updated 2 years ago
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
raunak-agarwal / instruction-datasets
View on GitHub
Datasets for Instruction Tuning of Large Language Models
☆261Nov 30, 2023Updated 2 years ago