SuperBruceJia / Awesome-Large-Vision-Language-ModelLinks
Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
☆39Updated 5 months ago
Alternatives and similar repositories for Awesome-Large-Vision-Language-Model
Users that are interested in Awesome-Large-Vision-Language-Model are comparing it to the libraries listed below
Sorting:
- Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning☆201Updated last year
- Reading list for Multimodal Large Language Models☆69Updated 2 years ago
- ☆135Updated 9 months ago
- Awesome Mixture of Experts (MoE): A Curated List of Mixture of Experts (MoE) and Mixture of Multimodal Experts (MoME)☆51Updated 2 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆104Updated 8 months ago
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆82Updated 6 months ago
- ☆77Updated 7 months ago
- Survey of Small Language Models from Penn State, ...☆229Updated last month
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆139Updated last year
- Collection of latest papers and materials in the area of RLVR!☆45Updated last month
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆136Updated 8 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆63Updated 7 months ago
- ☆33Updated 11 months ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆35Updated 8 months ago
- A regression-alike loss to improve numerical reasoning in language models - ICML 2025☆27Updated 4 months ago
- ☆97Updated last year
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆88Updated 2 years ago
- Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.☆132Updated this week
- A curated list of Large Language Model with RAG☆81Updated 2 years ago
- An open-source implementaion for fine-tuning Llama3.2-Vision series by Meta.☆173Updated 2 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆99Updated last year
- ☆146Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 6 months ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆56Updated 2 years ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated last month
- [CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for C…☆274Updated 11 months ago
- ☆58Updated 9 months ago
- A curated list of awesome Multimodal studies.☆301Updated last week
- Reproduction of DeepSeek-R1☆242Updated 8 months ago
- ☆40Updated last year