ChenyuHeidiZhang/VL-commonsense

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChenyuHeidiZhang/VL-commonsense)

ChenyuHeidiZhang / VL-commonsense

☆14

Alternatives and similar repositories for VL-commonsense

Users that are interested in VL-commonsense are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xxxiaol / spatial-commonsense
View on GitHub
Source code and data for Things not Written in Text: Exploring Spatial Commonsense from Visual Signals (ACL2022 main conference paper).
☆20Oct 10, 2022Updated 3 years ago
TobiasLee / VEC
View on GitHub
Visual and Embodied Concepts evaluation benchmark
☆21Oct 10, 2023Updated 2 years ago
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago
zinengtang / VidLanKD
View on GitHub
Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))
☆56Feb 6, 2023Updated 3 years ago
liujch1998 / vera
View on GitHub
☆17May 23, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
MAmmoTH-VL / MAmmoTH-VL
View on GitHub
(ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
☆50Jun 4, 2025Updated last year
ampersandmcd / DeepExtremeMixtureModel
View on GitHub
Official code release for Deep Extreme Mixture Model by Wilson, McDonald, Galib, Tan, and Luo.
☆10Feb 11, 2022Updated 4 years ago
nlpapereading / nlpapereading
View on GitHub
☆58Sep 23, 2022Updated 3 years ago
EdinburghNLP / spot-data
View on GitHub
Sentiment polarity annotations dataset
☆26Nov 28, 2017Updated 8 years ago
gchhablani / multilingual-vqa
View on GitHub
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆33Jul 27, 2021Updated 5 years ago
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
View on GitHub
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆33May 16, 2024Updated 2 years ago
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lscpku / VITATECS
View on GitHub
☆18Jul 10, 2024Updated 2 years ago
MME-Benchmarks / MME-Unify
View on GitHub
✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models
☆43Apr 10, 2025Updated last year
aiueola / wsdm2022-cascade-dr
View on GitHub
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
☆13Jul 16, 2023Updated 3 years ago
ademakdogan / plant_detector
View on GitHub
PlantDetector provides easy development (training and prediction) for object detection. DETR (End-to-End Object Detection with Transforme…
☆11Aug 1, 2022Updated 3 years ago
hughplay / memo
View on GitHub
📝 Anything for coding faster and more comfortable.
☆13Jan 21, 2026Updated 6 months ago
xguo7 / Automatic-Controllable-Product-Copywriting-for-E-Commerce
View on GitHub
☆16Nov 3, 2022Updated 3 years ago
YilunZhou / path-naturalness-prediction
View on GitHub
Code repository for the WWW 2019 paper "Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness"
☆12Feb 1, 2019Updated 7 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
iesl / ProtoQA_GPT2
View on GitHub
This is the GPT2 baseline for ProtoQA
☆12Jan 3, 2022Updated 4 years ago
eval4nlp / SharedTask2023
View on GitHub
☆11Jul 6, 2024Updated 2 years ago
galaxyproject / gravity
View on GitHub
Galaxy process management and system administration tools
☆14Updated this week
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
Victorwz / tod_as_nlg
View on GitHub
Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".
☆14Apr 6, 2022Updated 4 years ago
ruyimarone / data-portraits
View on GitHub
Documenting large text datasets 🖼️ 📚
☆14Dec 17, 2024Updated last year
skandavivek / web-qa
View on GitHub
☆11Feb 25, 2024Updated 2 years ago
llyx97 / sparse-and-robust-PLM
View on GitHub
[NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…
☆21Jan 9, 2024Updated 2 years ago
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
hughplay / DeepCodebase
View on GitHub
A template for deep learning projects.
☆16May 7, 2025Updated last year
DoubtedSteam / MM-GCoT
View on GitHub
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆22Jul 21, 2025Updated last year
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
jalayrac / instructionVideos
View on GitHub
Code for the paper "Unsupervised Learning from Narrated Instruction Videos", CVPR2016
☆20Jul 27, 2016Updated 10 years ago
LCO-Embedding / LCO-Embedding
View on GitHub
[NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning
☆48Apr 13, 2026Updated 3 months ago
Victorwz / zs-nmt-dae
View on GitHub
Official implementation of EMNLP 2021 Paper "Rethinking Zero-shot Neural Machine Translation: From a Perspective of Latent Variables"
☆12May 15, 2023Updated 3 years ago