research work on multimodal cognitive ai
☆68Mar 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for multimodal_cognitive_ai
Users that are interested in multimodal_cognitive_ai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated last year
- Code Repository for CausalDiffAE (ECAI 2024)☆20Oct 19, 2024Updated last year
- Code for EDLCV 2020 paper "Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Spa…☆21Apr 18, 2020Updated 5 years ago
- SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models☆21Jan 11, 2024Updated 2 years ago
- This packages provides a simple python implementation of Invariant Causal Prediction (ICP)☆13Mar 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 6 months ago
- PKU-I2IQA: An Image-to-Image Quality Assessment Database for AI Generated Images☆16Dec 4, 2024Updated last year
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 8 months ago
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆18Oct 17, 2024Updated last year
- Repository of paper Consistency-preserving Visual Question Answering in Medical Imaging (MICCAI2022)☆25Mar 28, 2023Updated 3 years ago
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- ☆16Sep 6, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)☆13Apr 17, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code for the CVPR 2024 Paper "Can Biases in ImageNet Models Explain Generalization?".☆13Jun 24, 2024Updated last year
- ☆18Sep 13, 2023Updated 2 years ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)☆11Oct 31, 2023Updated 2 years ago
- Official pytorch implementation for SingleInsert☆28Apr 19, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20May 27, 2024Updated last year
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆37Jan 25, 2024Updated 2 years ago
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆30Aug 2, 2022Updated 3 years ago
- A Python API for accessing the music, metadata, and associated images from www.bensound.com.☆10Aug 12, 2020Updated 5 years ago
- ☆16Apr 21, 2025Updated 11 months ago
- Radiology Language Evaluations☆11Nov 17, 2023Updated 2 years ago
- [ICML2023] InfoOT: Information Maximizing Optimal Transport☆41Apr 27, 2023Updated 2 years ago
- [ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.☆71Feb 28, 2026Updated last month
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆131Jul 24, 2025Updated 8 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆294Jul 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆24Mar 17, 2025Updated last year
- [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs☆157Jul 23, 2024Updated last year
- This is a community implementation for the paper EcoTTA: Memory-Efficient Continual Test-time Adaptation via Self-distilled Regularizatio…☆37Aug 4, 2023Updated 2 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆14Jul 26, 2023Updated 2 years ago
- Code to reproduce the experimental results from the paper "Active Invariant Causal Prediction: Experiment Selection Through Stability", b…☆21Jul 6, 2023Updated 2 years ago
- Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost☆39Jun 21, 2023Updated 2 years ago
- [TMM] MINT-IQA: Quality Assessment for AI Generated Images with Instruction Tuning☆20Nov 21, 2025Updated 4 months ago