UMass-Embodied-AGI / genome
☆15Updated last year
Alternatives and similar repositories for genome:
Users that are interested in genome are comparing it to the libraries listed below
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆36Updated last year
- ☆38Updated 2 years ago
- ☆42Updated 10 months ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆48Updated last month
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆33Updated last year
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆18Updated 2 years ago
- [TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.☆116Updated last year
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022☆30Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆30Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated last year
- Official Code for Neural Systematic Binder☆32Updated last year
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆21Updated last year
- ☆67Updated last year
- ☆68Updated 3 months ago
- ☆41Updated last year
- Recursive Visual Programming (ECCV 2024)☆17Updated 4 months ago
- Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)☆33Updated last year
- ☆29Updated 9 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37Updated last year
- Differentiable First-Order Logic Reasoning for Visual Question Answering☆39Updated 4 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 9 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆32Updated last year
- ☆45Updated 11 months ago
- ☆32Updated 3 years ago
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆46Updated last year
- ☆26Updated 3 years ago