PCIResearch / TransCore-MLinks
Large Multimodal Model
☆15Updated last year
Alternatives and similar repositories for TransCore-M
Users that are interested in TransCore-M are comparing it to the libraries listed below
Sorting:
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48Updated last year
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆33Updated 6 months ago
- ☆22Updated last year
- ChineseCLIP using online learning☆13Updated 3 years ago
- ☆19Updated 2 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆20Updated last year
- ☆87Updated last year
- ☆18Updated 3 years ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Updated 3 years ago
- ☆72Updated 2 years ago
- [ACM MM2025] The official repository for the RealSyn dataset☆38Updated 5 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆106Updated last year
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Updated 4 months ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆41Updated 2 years ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆64Updated last year
- ☆16Updated 8 months ago
- ☆123Updated last year
- Toward Universal Multimodal Embedding☆68Updated 4 months ago
- ☆73Updated 6 months ago
- ☆91Updated 2 years ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆20Updated 2 months ago
- A huge dataset for Document Visual Question Answering☆20Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Large-batch Optimization for Dense Visual Predictions (NeurIPS 2022)☆57Updated 3 years ago
- official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"☆38Updated 5 months ago
- ☆21Updated last year
- Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021☆58Updated 3 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆20Updated 9 months ago
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Updated last year