AI-Application-and-Integration-Lab / OMTSegLinks
[ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model
☆16Updated last month
Alternatives and similar repositories for OMTSeg
Users that are interested in OMTSeg are comparing it to the libraries listed below
Sorting:
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆17Updated last month
- Scene-Text-Detection-And-Recognition-Model_M504☆25Updated 10 months ago
- ☆13Updated 2 years ago
- ☆13Updated last year
- ☆39Updated 6 months ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆52Updated last year
- ☆17Updated 2 months ago
- A distributed training framework for large language models powered by Lightning.☆22Updated 3 months ago
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆50Updated 2 weeks ago
- ☆26Updated 8 months ago
- A Survey on Multimodal Retrieval-Augmented Generation☆231Updated 3 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆402Updated this week
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆204Updated 2 weeks ago
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆12Updated last year
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Updated 9 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆217Updated 2 months ago
- The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"☆14Updated last year
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆41Updated 5 months ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆9Updated last year
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆85Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆46Updated last month
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆137Updated 2 years ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆143Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.☆453Updated this week
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆83Updated 11 months ago
- ☆136Updated last year
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆803Updated 8 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆286Updated 8 months ago
- Latest Advances on Modality Priors in Multimodal Large Language Models☆20Updated last month
- ☆14Updated last year