vincentlux / Awesome-Multimodal-LLMView external linksLinks
Reading list for Multimodal Large Language Models
☆69Aug 17, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-Multimodal-LLM
Users that are interested in Awesome-Multimodal-LLM are comparing it to the libraries listed below
Sorting:
- Research Trends in LLM-guided Multimodal Learning.☆357Oct 17, 2023Updated 2 years ago
- Python scripts for setting up private LLM's on local and in the cloud with LangChain, GPT4All and Cerebrium☆11May 29, 2023Updated 2 years ago
- ☆55Apr 1, 2024Updated last year
- [ACMMM2025] Official released code for ALLM4ADD☆36Oct 30, 2025Updated 3 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated last year
- ☆101Dec 22, 2023Updated 2 years ago
- List of reference,algorithms, applications in SSL in RS (contribution are welcome)☆18May 1, 2023Updated 2 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆72Oct 16, 2024Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated 3 weeks ago
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆25Oct 22, 2023Updated 2 years ago
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆35Nov 29, 2024Updated last year
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆529Apr 8, 2024Updated last year
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆28Jan 28, 2025Updated last year
- Recent Mexican Election Vote Returns☆10Feb 9, 2026Updated last week
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated last year
- An Easy-to-use Hallucination Detection Framework for LLMs.☆63Apr 21, 2024Updated last year
- ☆32Mar 25, 2024Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆281Jun 25, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- A Chat GUI for local Alpaca language model with guided download and install instructions☆34Apr 24, 2023Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- Watch Youtube videos unblocked, unrestricted, with no ads and history hiding☆15Jan 31, 2026Updated 2 weeks ago
- A RLHF Infrastructure for Vision-Language Models☆196Nov 15, 2024Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- [ICASSP 2024] The official repo for Harnessing the Power of Large Vision Language Models for Synthetic Image Detection☆34Aug 13, 2025Updated 6 months ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- 学习tensorflow☆34Sep 21, 2020Updated 5 years ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆306Sep 11, 2024Updated last year
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆72Apr 15, 2025Updated 10 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆35Aug 2, 2023Updated 2 years ago
- ☆37Dec 6, 2024Updated last year
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆336Jul 17, 2024Updated last year
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- Panorama_498全景图像数据集☆14Apr 8, 2022Updated 3 years ago