Reading list for Multimodal Large Language Models
☆69Aug 17, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-Multimodal-LLM
Users that are interested in Awesome-Multimodal-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research Trends in LLM-guided Multimodal Learning.☆356Oct 17, 2023Updated 2 years ago
- A library for training crosscoders☆17May 28, 2025Updated 11 months ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆369Mar 19, 2025Updated last year
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- Official PyTorch Repository of "Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition" (AAAI …☆13Jul 27, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Apr 13, 2026Updated 2 weeks ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆76Oct 16, 2024Updated last year
- This is the official pytorch implementation of "Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation" (ECCV 2024).☆18Aug 7, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆30Oct 23, 2025Updated 6 months ago
- S2R-HDR: A Large-Scale Rendered Dataset for HDR Fusion☆17Feb 11, 2026Updated 2 months ago
- ☆19Feb 12, 2025Updated last year
- ☆55Apr 1, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- 14 million, semi-supervised, mental disorder detection data.☆15Oct 23, 2024Updated last year
- [CVPR 2024] Diversity-aware Channel Pruning for StyleGAN Compression☆25Jul 23, 2025Updated 9 months ago
- Recent Mexican Election Vote Returns☆11Apr 18, 2026Updated last week
- Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.☆757Apr 6, 2026Updated 3 weeks ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆25Oct 22, 2023Updated 2 years ago
- Repository of the paper "Reconstruction of Time-Varying Graph Signals via Sobolev Smoothness" published in IEEE T-SIPN☆13Mar 3, 2022Updated 4 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CLI to convert Scrapbox page to Markdown☆12Dec 4, 2025Updated 4 months ago
- 💬A curated list of incredible amount of publications related to Dialogue Systems especially Chatbots and Chit-chat Systems☆10Dec 5, 2019Updated 6 years ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.☆16Dec 19, 2023Updated 2 years ago
- Gender prediction of chinese name based on LSTM☆14Mar 16, 2023Updated 3 years ago
- Re-implementation of the published ECCV '20 paper on reciprocal points for open-set recognition. Currently the state-of-the-art in open-s …☆26Feb 12, 2021Updated 5 years ago
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Corpus analyses confrontation☆21Jan 24, 2023Updated 3 years ago
- This repository stores the proposals submitted to the NumFOCUS Small Development Grants (SDG) program.☆19Nov 18, 2025Updated 5 months ago
- [EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation☆13Dec 4, 2023Updated 2 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- Official PyTorch Repository of "Tailoring Self-Supervision for Supervised Learning" (ECCV 2022 Paper)☆30Jul 28, 2023Updated 2 years ago
- ☆20Jul 11, 2023Updated 2 years ago
- A web application for visualizing the results of social science survey data.☆12May 22, 2020Updated 5 years ago