PCIResearch / TransCore-MLinks
Large Multimodal Model
☆15Updated last year
Alternatives and similar repositories for TransCore-M
Users that are interested in TransCore-M are comparing it to the libraries listed below
Sorting:
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆35Updated 6 months ago
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48Updated last year
- ☆23Updated last year
- ☆19Updated 2 years ago
- ChineseCLIP using online learning☆13Updated 3 years ago
- ☆18Updated 3 years ago
- A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).☆20Updated last year
- Toward Universal Multimodal Embedding☆72Updated 4 months ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Updated last year
- ☆72Updated 2 years ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Updated 5 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆65Updated last year
- The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.☆26Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆108Updated last year
- ☆124Updated last year
- ☆87Updated last year
- ☆74Updated 7 months ago
- [ACM MM2025] The official repository for the RealSyn dataset☆39Updated last week
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated last year
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Updated 3 years ago
- Large-batch Optimization for Dense Visual Predictions (NeurIPS 2022)☆57Updated 3 years ago
- Facebook Image Similarity Challenge 2021☆19Updated 4 years ago
- ☆30Updated last year
- ☆91Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆18Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆41Updated 2 years ago
- Our 2nd-gen LMM☆34Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆131Updated 5 months ago