Toloka / WSDMCup2023Links

Toloka Visual Question Answering Challenge at WSDM Cup 2023

☆31

Alternatives and similar repositories for WSDMCup2023

Users that are interested in WSDMCup2023 are comparing it to the libraries listed below

Sorting:

YuanEZhou / satic
☆26Updated 4 years ago
lucidrains / multistream-transformers
Implementation of Multistream Transformers in Pytorch
☆54Updated 4 years ago
Kirill-Kravtsov / drophead-pytorch
An implementation of drophead regularization for pytorch transformers
☆19Updated 3 years ago
lucidrains / AoA-pytorch
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
☆43Updated 4 years ago
davidsvy / cosformer-pytorch
Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".
☆44Updated 3 years ago
gchhablani / multilingual-vqa
Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.
☆34Updated 4 years ago
LooperXX / ManagerTower
Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
☆11Updated 7 months ago
facebookresearch / SIMAT
codebase for the SIMAT dataset and evaluation
☆38Updated 3 years ago
jaketae / fnet
PyTorch implementation of FNet: Mixing Tokens with Fourier transforms
☆27Updated 4 years ago
mshukor / eP-ALM
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Updated last year
zhjohnchan / bert-clip-synesthesia
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Updated 2 years ago
jeykigung / HiCLIP
☆29Updated 2 years ago
facebookresearch / data2vec_vision
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆78Updated 3 years ago
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
VITA-Group / layerGraftedPretraining_ICLR23
[ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…
☆24Updated 2 years ago
researchmm / generate-it
A collection of models for image<->text generation in ACM MM 2021.
☆66Updated 3 years ago
lucidrains / cross-transformers-pytorch
Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch
☆53Updated 4 years ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
leaderj1001 / Synthesizer-Rethinking-Self-Attention-Transformer-Models
Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch
☆70Updated 5 years ago
Alibaba-MIIL / ZS_SDL
Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper
☆30Updated 2 years ago
usydnlp / vdoc
☆15Updated 2 years ago
goel-shashank / CyCLIP
☆120Updated 2 years ago
jonkahana / CLIPPR
An official PyTorch implementation for CLIPPR
☆29Updated 2 years ago
zinengtang / Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
☆33Updated 2 years ago
joaanna / disentangling_spelling_in_clip
☆34Updated 2 years ago
allenai / grit_official
Official repository for the General Robust Image Task (GRIT) Benchmark
☆54Updated 2 years ago
lucidrains / omninet-pytorch
Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch
☆58Updated 4 years ago
gchhablani / multilingual-image-captioning
☆44Updated 4 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Updated 3 years ago
google-research / fnc
☆28Updated 3 years ago