research work on multimodal cognitive ai
☆68Mar 3, 2026Updated this week
Alternatives and similar repositories for multimodal_cognitive_ai
Users that are interested in multimodal_cognitive_ai are comparing it to the libraries listed below
Sorting:
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Mar 28, 2023Updated 2 years ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆10Mar 1, 2025Updated last year
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 5 months ago
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".☆13Jun 24, 2024Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated last year
- ☆16Sep 6, 2024Updated last year
- PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images☆16Dec 4, 2024Updated last year
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- This packages provides a simple python implementation of Invariant Causal Prediction (ICP)☆13Mar 22, 2024Updated last year
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆38Aug 19, 2023Updated 2 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆14Jul 26, 2023Updated 2 years ago
- Official implementation for "Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training" https://arxiv.org/abs/…☆76Dec 25, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- [ICML2023] InfoOT: Information Maximizing Optimal Transport☆41Apr 27, 2023Updated 2 years ago
- [ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks☆14May 2, 2025Updated 10 months ago
- CatMAE☆14Dec 13, 2023Updated 2 years ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- Code for ICML2023 paper, DDGR: Continual Learning with Deep Diffusion-based Generative Replay.☆39Aug 21, 2023Updated 2 years ago
- Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)☆71Jan 19, 2024Updated 2 years ago
- Frequency Shortcuts in Neural Networks☆21Nov 1, 2024Updated last year
- This is the official repository for the IEEE TMI paper titled "Large Language Model with Region-Guided Referring and Grounding for CT Rep…☆68Jun 28, 2025Updated 8 months ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆24Mar 17, 2025Updated 11 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated last year
- [TMM] MINT-IQA: Quality Assessment for AI Generated Images with Instruction Tuning☆20Nov 21, 2025Updated 3 months ago
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆84Oct 6, 2024Updated last year
- Source codes of the paper "Hierarchical Pretraining on Multimodal Electronic Health Records".☆20Apr 10, 2024Updated last year
- ☆25Nov 30, 2023Updated 2 years ago
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated 2 months ago
- Official PyTorch implementation of paper: "Revisiting the Importance of Amplifying Bias for Debiasing" (AAAI 2023)☆19Dec 30, 2022Updated 3 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- The code and pre-trained models of the paper "Masked Autoencoders as Image Processors" will be released in this repository.☆22Mar 31, 2023Updated 2 years ago
- The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".☆18May 10, 2023Updated 2 years ago
- Code for EDLCV 2020 paper "Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Spa…☆20Apr 18, 2020Updated 5 years ago
- [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs☆157Jul 23, 2024Updated last year