CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
☆29Oct 27, 2023Updated 2 years ago
Alternatives and similar repositories for CLEVR-X
Users that are interested in CLEVR-X are comparing it to the libraries listed below
Sorting:
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated 11 months ago
- Official implementation for the CVPR 2024 paper CAMEL☆20Jun 20, 2024Updated last year
- ☆19Apr 1, 2025Updated 11 months ago
- Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…☆10Jun 16, 2024Updated last year
- Code base for paper "Finding Structural Knowledge in Multimodal-BERT". Framework for probing and code for creating Scene Trees.☆10May 19, 2022Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Aug 16, 2022Updated 3 years ago
- TACO: TFBS-Aware Cis-Regulatory Element Optimization☆21Aug 1, 2025Updated 7 months ago
- ☆16Dec 25, 2021Updated 4 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆18Nov 30, 2024Updated last year
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Jul 17, 2023Updated 2 years ago
- ☆29Mar 30, 2025Updated 11 months ago
- ☆20Oct 21, 2022Updated 3 years ago
- [CIKM2023] The official implementation of "MPerformer: An SE(3) Transformer-based Molecular Perceptron"☆28Nov 12, 2024Updated last year
- Official implementation of the "Multimodal Parameter-Efficient Few-Shot Class Incremental Learning" paper☆24Apr 18, 2024Updated last year
- A simple wrapper for lmdb. Support dict-like operations.☆23Apr 20, 2023Updated 2 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 4 years ago
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…☆29Dec 2, 2024Updated last year
- [AAAI'25] MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation☆35Oct 26, 2025Updated 4 months ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- The PyTorch implementation of MoMu, described in "Natural Language-informed Modeling of Molecule Graphs".☆29Jul 17, 2023Updated 2 years ago
- Official implementation of the paper "MotionCrafter: One-Shot Motion Customization of Diffusion Models"☆28Jan 4, 2024Updated 2 years ago
- 板球控制系統/滾球系統/BallPlate 2017年全国大学生电子设计竞赛B题 全国二等奖作品☆10May 27, 2024Updated last year
- DMRM: A Dual-channel Multi-hop Reasoning Model for Visual Dialog☆25Mar 8, 2022Updated 3 years ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Dec 19, 2024Updated last year
- ☆12Apr 14, 2025Updated 10 months ago
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆42Nov 15, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 7 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- A CSS3 Overlay system for modal dialogs.☆66Dec 16, 2010Updated 15 years ago
- Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries☆10Dec 28, 2018Updated 7 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆42Aug 26, 2022Updated 3 years ago
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Oct 24, 2024Updated last year
- ☆40Nov 23, 2022Updated 3 years ago
- VQACL: A Novel Visual Question Answering Continual Learning Setting (CVPR'23)☆44Mar 28, 2024Updated last year