jiasenlu / unified-io

☆31

Related projects: ⓘ

easonnie / mlp-vil
MLPs for Vision and Langauge Modeling (Coming Soon)
☆27Updated 2 years ago
NVlabs / RelViT
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
☆63Updated 2 years ago
allenai / gpv2
☆32Updated 2 years ago
zfchenUnique / compositional_physics_learner
☆36Updated 2 years ago
zinengtang / Perceiver_VL
PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)
☆32Updated last year
antoine77340 / RareAct
RareAct: A video dataset of unusual interactions
☆32Updated 4 years ago
princetonvisualai / pointingqa
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆18Updated last year
princeton-vl / Rel3D
Official code for NeurRIPS 2020 paper "Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D"
☆26Updated last year
yuweihao / LV-BERT
LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)
☆18Updated last year
redcaps-dataset / redcaps-downloader
Command-line tool for downloading and extending the RedCaps dataset.
☆45Updated 9 months ago
karazijal / clevrtex-generation
☆39Updated 7 months ago
facebookresearch / GDT
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…
☆44Updated 3 years ago
zfchenUnique / DCL-Release
This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).
☆37Updated 2 months ago
StanLei52 / TQVSR
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆23Updated last year
KMnP / nn-revisit
Rethinking Nearest Neighbors for Visual Classification
☆31Updated 2 years ago
guilk / VLC
Research code for "Training Vision-Language Transformers from Captions Alone"
☆34Updated 2 years ago
mshukor / EvALign-ICL
[ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …
☆20Updated 6 months ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆17Updated 3 years ago
16lemoing / ccvs
CCVS: Context-aware Controllable Video Synthesis
☆21Updated 2 years ago
soCzech / LookForTheChange
Code for Look for the Change paper published at CVPR 2022
☆35Updated last year
airsplay / vimpac
☆74Updated 2 years ago
evelinehong / PTR
Official Repository of NeurIPS2021 paper: PTR
☆33Updated 2 years ago
evelinehong / VLGrammar
☆29Updated 2 years ago
minyoungg / overparam
☆40Updated last year
AllanYangZhou / generative-invariance-transfer
☆26Updated 2 years ago
donglixp / ICL_PaperList
Paper List for In-context Learning 🌷
☆20Updated last year
Jiahao000 / ORL
[NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images
☆58Updated 2 years ago
google-research / si-score
☆22Updated last month
zinengtang / DeCEMBERT
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Updated last year
Lizw14 / Super-CLEVR
Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"
☆20Updated last year