Awesome multi-modal large language paper/project, collections of popular training strategies, e.g., PEFT, LoRA.
☆27Aug 2, 2024Updated last year
Alternatives and similar repositories for Multi-Modal-Large-Language-Learning
Users that are interested in Multi-Modal-Large-Language-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated 11 months ago
- ☆11Jul 11, 2023Updated 2 years ago
- [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”☆46Jan 27, 2024Updated 2 years ago
- [IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”☆87Aug 14, 2024Updated last year
- This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.☆18Aug 6, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.☆19May 25, 2023Updated 2 years ago
- [TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"☆15May 12, 2025Updated 11 months ago
- Talk to ChatGPT and Generate image via any Matrix client!☆16Apr 25, 2023Updated 3 years ago
- ☆15Aug 4, 2020Updated 5 years ago
- Repository of CVPR'22 paper "Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression"☆117Aug 5, 2024Updated last year
- Repository for an end-to-end image captioning method PTSN(ACM MM22).☆60Dec 11, 2022Updated 3 years ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆38Mar 9, 2025Updated last year
- ChineseCLIP using online learning☆14Nov 7, 2022Updated 3 years ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆12Sep 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"☆61Jun 8, 2023Updated 2 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- ☆32Dec 6, 2025Updated 4 months ago
- The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.☆15Dec 25, 2023Updated 2 years ago
- The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World …☆48Apr 19, 2026Updated last week
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- ☆33Sep 11, 2025Updated 7 months ago
- ☆32Dec 14, 2025Updated 4 months ago
- Official pytorch implementation of CVPR2023 paper "Learning Conditional Attributes for Compositional Zero-Shot Learning"☆18Oct 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆23Sep 9, 2025Updated 7 months ago
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆50Feb 2, 2026Updated 3 months ago
- Xiwu: A Large Lanauge Model for High Energy Physics☆21Jan 20, 2025Updated last year
- This repository contains code and dataset splits for the paper "Classification by Attention: Scene Graph Classification with Prior Knowle…☆16May 27, 2022Updated 3 years ago
- Study materials about "Deep Learning for Molecular Applications".☆15Aug 5, 2019Updated 6 years ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆51Jul 1, 2025Updated 10 months ago
- [NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models☆26Oct 29, 2025Updated 6 months ago
- A list of papers in NeurIPS 2022 related to adversarial attack and defense / AI security.☆76Dec 5, 2022Updated 3 years ago
- Python implementation for paper: Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples☆11Jun 12, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Official implementation of the CVPR '25 highlight paper "Compositional Caching for Training-free Open-vocabulary Attribute Detection"☆24Dec 23, 2024Updated last year
- ☆12Mar 24, 2023Updated 3 years ago
- Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro☆11May 18, 2024Updated last year
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆13Aug 9, 2023Updated 2 years ago
- ☆12Jan 21, 2019Updated 7 years ago