Reading list for Multimodal Large Language Models
☆70Aug 17, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-Multimodal-LLM
Users that are interested in Awesome-Multimodal-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research Trends in LLM-guided Multimodal Learning.☆356Oct 17, 2023Updated 2 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- ☆16May 30, 2025Updated 11 months ago
- [AAAI 2023] Official repository of "Progressive Few-Shot Adaptation of Generative Model with Align-Free Spatial Correlation"☆10Jul 4, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27May 11, 2026Updated last week
- ☆102Dec 22, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆76Oct 16, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆31Oct 23, 2025Updated 6 months ago
- ☆55Apr 1, 2024Updated 2 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- [CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression☆25Jul 23, 2025Updated 9 months ago
- Recent Mexican Election Vote Returns☆11Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.☆758Apr 6, 2026Updated last month
- Neuron Activation☆27Nov 21, 2024Updated last year
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆25Oct 22, 2023Updated 2 years ago
- ICCV 2021☆34May 11, 2022Updated 4 years ago
- Repository of the paper "Reconstruction of Time-Varying Graph Signals via Sobolev Smoothness" published in IEEE T-SIPN☆13Mar 3, 2022Updated 4 years ago
- paddle code convert toolkit☆22Mar 19, 2023Updated 3 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- CLI to convert Scrapbox page to Markdown☆12Dec 4, 2025Updated 5 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Gender prediction of chinese name based on LSTM☆14Mar 16, 2023Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆21Nov 27, 2025Updated 5 months ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- Awesome List of Vision Language Prompt Papers☆48Nov 9, 2023Updated 2 years ago
- Corpus analyses confrontation☆21Jan 24, 2023Updated 3 years ago
- This repository stores the proposals submitted to the NumFOCUS Small Development Grants (SDG) program.☆19Nov 18, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Feb 7, 2023Updated 3 years ago
- Official PyTorch Repository of "Tailoring Self-Supervision for Supervised Learning" (ECCV 2022 Paper)☆30Jul 28, 2023Updated 2 years ago
- Latest Advances on Multimodal Large Language Models☆17,795May 1, 2026Updated 2 weeks ago
- GitHack is a social gamification platform for GitHub☆15Oct 8, 2014Updated 11 years ago
- Explaining neural decisions contrastively to alternative decisions.☆24Mar 18, 2021Updated 5 years ago
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆37Nov 29, 2024Updated last year
- Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"☆29Mar 7, 2023Updated 3 years ago