NKU-MetautoAI / awesome-large-vision-language-modelsView external linksLinks
Advances in recent large vision language models (LVLMs)
☆15Sep 23, 2024Updated last year
Alternatives and similar repositories for awesome-large-vision-language-models
Users that are interested in awesome-large-vision-language-models are comparing it to the libraries listed below
Sorting:
- ☆17Oct 1, 2024Updated last year
- Python大作业虚假新闻检测☆27Jan 4, 2025Updated last year
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆33Jul 16, 2025Updated 6 months ago
- NKU_2022Fall Python language programming project. 虚假新闻检测项目,分别使用机器学习、深度学习和bert方法完成任务☆34Nov 24, 2024Updated last year
- GCAGC, CVPR2020, GCAGC-Inst, TMM2021. Adaptive Graph Convolutional Network with Attention Graph Clustering for Co-saliency Detection☆38Jul 27, 2021Updated 4 years ago
- Unity 结合AIGC 虚拟仿真人项目☆11Feb 21, 2025Updated 11 months ago
- Predicting emotions on Android☆11Nov 26, 2020Updated 5 years ago
- [ICCV2025] Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning☆23Nov 13, 2025Updated 3 months ago
- [MICCAI 2023] Official implementation of our MICCAI 2023 paper "Pick the Best Pre-trained Model: Towards Transferability Estimation for M…☆13Jul 27, 2023Updated 2 years ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 10 months ago
- Light Field Super-Resolution Network Using Joint Spatio-Angular and Epipolar Information☆10May 31, 2023Updated 2 years ago
- Detect wildfires using ML on images from cameras on vantage points☆11Oct 16, 2024Updated last year
- ☆11Feb 28, 2024Updated last year
- Official repository for 'Risk of Bias in Chest Radiography Deep Learning Foundation Models'☆12Sep 27, 2023Updated 2 years ago
- Skin3D: Detection and Longitudinal Tracking of Pigmented Skin Lesions in 3D Total-Body Textured Meshes☆10Aug 31, 2025Updated 5 months ago
- [IEEE RA-L 2025] The official repository for Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Reco…☆58Jun 2, 2025Updated 8 months ago
- ☆49Oct 11, 2021Updated 4 years ago
- Source code for "MEDIMP: 3D Medical Images with clinical Prompts from limited tabular data for renal transplantation", MIDL 2023, https:/…☆10Apr 29, 2023Updated 2 years ago
- ☆13Mar 15, 2024Updated last year
- Code of Decomposition and Completion Network for Salient Object Detection, TIP 2021.☆10Mar 30, 2023Updated 2 years ago
- 南开大学操作系统课程实验(UCore)☆11Oct 16, 2022Updated 3 years ago
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated 10 months ago
- ☆12Nov 22, 2022Updated 3 years ago
- Ensemble Learning of Foundation Models☆17Aug 29, 2025Updated 5 months ago
- The medical imaging meta-learning toolbox allows to build models that learn to learn in a setting with diverse tasks. It also provides co…☆44May 9, 2024Updated last year
- [MICCAI‘25 Early Accept] MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment☆15Nov 15, 2025Updated 3 months ago
- GHUStereo models are novel real-time stereo matching architectures with a low computation complexity characterized by compact cost volum…☆29Dec 14, 2025Updated 2 months ago
- [TMM 2023] Official Implementation of "Bidirectional Translation Between UHD-HDR and HD-SDR Videos"☆10Aug 8, 2024Updated last year
- Code for A Dual Domain Multi-exposure Image Fusion Network Based on the Spatial-frequency Integration.☆12Jul 25, 2024Updated last year
- Create reliability diagrams to quantify ML calibration.☆10Feb 1, 2022Updated 4 years ago
- An efficent (O(1)) algorithm to extract numbers from a non-uniform discrete probability distribution☆10Jun 19, 2019Updated 6 years ago
- Implementation of the spotlight: a method for discovering systematic errors in deep learning models☆11Oct 5, 2021Updated 4 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- Building deployable wildfire smoke detection models☆10Nov 5, 2020Updated 5 years ago
- A holistic framework for advancing LLMs as data science agents☆30Feb 3, 2026Updated last week
- 基于Dear -imgui 的软光栅化渲染器 (学习项目)☆10Feb 21, 2025Updated 11 months ago
- Code related to the paper "MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion"☆12Dec 14, 2024Updated last year
- This is a PyTorch/GPU implementation of the Information Fusion 2022 paper: Rethinking multi-exposure image fusion with extreme and divers…☆14Sep 4, 2023Updated 2 years ago
- Codes of “Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification”☆11Aug 9, 2022Updated 3 years ago