jinbo0906 / Awesome-MLLM-DatasetsView external linksLinks
This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training data, instruction fine-tuning data, and In-Context learning data.
☆68May 7, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-MLLM-Datasets
Users that are interested in Awesome-MLLM-Datasets are comparing it to the libraries listed below
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- 🔨🔨🔨Tool for making model training data set☆20Nov 1, 2024Updated last year
- ☆11May 17, 2024Updated last year
- 基于LLaVA1.6微调的Xray识别的多模态大模型☆10Oct 22, 2024Updated last year
- Multi-Task instruction-tuned LLaMA☆14May 5, 2023Updated 2 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆18May 14, 2024Updated last year
- ☆30Aug 21, 2025Updated 5 months ago
- ☆25Feb 2, 2025Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 9 months ago
- Awesome paper for multi-modal llm with grounding ability☆19Oct 11, 2025Updated 4 months ago
- Web application for real-time object detection 🔎 using Flask 🌶, OpenCV, and YoloV3 weights. It uses the COCO Dataset 🖼.☆16Apr 19, 2021Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- ☆20Jan 6, 2023Updated 3 years ago
- A Collection of Papers on Diffusion Language Models☆157Sep 15, 2025Updated 5 months ago
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆48Updated this week
- Export Donut model to onnx and run it with onnxruntime☆23Nov 21, 2023Updated 2 years ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated 11 months ago
- This is for ACL 2025 Findings Paper: From Specific-MLLMs to Omni-MLLMs: A Survey on MLLMs Aligned with Multi-modalitiesModels☆90Jan 3, 2026Updated last month
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆361Mar 19, 2025Updated 10 months ago
- ☆32Nov 15, 2022Updated 3 years ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆136Nov 17, 2025Updated 2 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Nov 30, 2023Updated 2 years ago
- ☆111Jan 8, 2025Updated last year
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Sep 9, 2024Updated last year
- A Survey of Task-Oriented Knowledge Graph Reasoning: Status, Applications, and Prospects☆72Jul 11, 2025Updated 7 months ago
- The Next Step Forward in Multimodal LLM Alignment☆197May 1, 2025Updated 9 months ago
- Public code repository to reproduce our MICCAI 2022 paper: "Automatic identification of segmentation errors for radiotherapy using geomet…☆11Dec 8, 2022Updated 3 years ago
- [CVPR 2025] Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding☆15Jun 16, 2025Updated 8 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆26Feb 4, 2026Updated last week
- Efficient Segment Anything in Medical Images☆42Jul 27, 2024Updated last year
- EHR datasets preprocessing scripts☆11Jan 31, 2024Updated 2 years ago
- ☆10Nov 29, 2022Updated 3 years ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆11Updated this week
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago