yonatanbitton/wysiwyr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yonatanbitton/wysiwyr)

yonatanbitton / wysiwyr

☆37

Alternatives and similar repositories for wysiwyr

Users that are interested in wysiwyr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
archiki / RepARe
View on GitHub
☆21Oct 10, 2023Updated 2 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
LCS2-IIITD / DABERTA-EMNLP-2022
View on GitHub
☆11Apr 4, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
google-research-datasets / maverics
View on GitHub
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…
☆13Feb 18, 2023Updated 3 years ago
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
VickiCui / MORE
View on GitHub
Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"
☆11Oct 11, 2024Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago
mshukor / ViCHA
View on GitHub
[BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"
☆54Oct 20, 2022Updated 3 years ago
SivanDoveh / DAC
View on GitHub
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆28Nov 29, 2023Updated 2 years ago
google-research-datasets / 2.5vrd
View on GitHub
This dataset contains about 110k images annotated with the depth and occlusion relationships between arbitrary objects. It enables resear…
☆16Apr 28, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mlfoundations / imagenet-captions
View on GitHub
Release of ImageNet-Captions
☆51Jan 20, 2023Updated 3 years ago
vsahil / MIMETIC-2
View on GitHub
Official Code for MIMETIC^2
☆13Nov 19, 2024Updated last year
jimmyxu123 / SELECT
View on GitHub
This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"
☆16Oct 8, 2024Updated last year
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
McGill-NLP / diffusion-itm
View on GitHub
Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"
☆33Mar 15, 2024Updated 2 years ago
ugorsahin / Generative-Negative-Mining
View on GitHub
[WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024
☆13Jan 3, 2024Updated 2 years ago
hassyGo / pytorch-playground
View on GitHub
My PyTorch playground for NLP
☆13Sep 20, 2018Updated 7 years ago
Letian2003 / C-VQA
View on GitHub
Counterfactual Reasoning VQA Dataset
☆28Nov 23, 2023Updated 2 years ago
VityaSchel / vfs-status-bot
View on GitHub
Read-only mirror of https://git.hloth.dev/hloth/vfs-status-bot
☆12Jul 14, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
HYPJUDY / Sparkles
View on GitHub
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
☆45Jun 14, 2024Updated 2 years ago
zhuang-li / FactualSceneGraph
View on GitHub
[ACL 2023 Findings] FACTUAL dataset, the textual scene graph parser trained on FACTUAL.
☆131Jun 15, 2026Updated last month
google-research / pactran_metrics
View on GitHub
☆14Mar 24, 2023Updated 3 years ago
umd-huang-lab / Mementos
View on GitHub
☆32Feb 8, 2024Updated 2 years ago
mlfoundations / VisIT-Bench
View on GitHub
☆51Oct 29, 2023Updated 2 years ago
jiyounglee-0523 / VisAlign
View on GitHub
☆20Apr 23, 2024Updated 2 years ago
ggjy / vision_weak_to_strong
View on GitHub
☆38Feb 8, 2024Updated 2 years ago
linzhiqiu / CLIP-FlanT5
View on GitHub
Training code for CLIP-FlanT5
☆31Jul 29, 2024Updated last year
ajd12342 / why-winoground-hard
View on GitHub
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
☆31May 29, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AILab-CVC / SEED-Bench
View on GitHub
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
☆366Jan 14, 2025Updated last year
google-deepmind / svo_probes
View on GitHub
The SVO-Probes Dataset for Verb Understanding
☆29Jan 28, 2022Updated 4 years ago
muliyangm / ComEx
View on GitHub
Code for ComEx [CVPR 2022]
☆12Dec 5, 2022Updated 3 years ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
open-vision-language / oven
View on GitHub
☆47Aug 15, 2023Updated 2 years ago
elisakreiss / concadia
View on GitHub
☆16Jan 3, 2023Updated 3 years ago