Awesome paper for multi-modal llm with grounding ability
☆19Oct 11, 2025Updated 4 months ago
Alternatives and similar repositories for awesome-mllm-grounding
Users that are interested in awesome-mllm-grounding are comparing it to the libraries listed below
Sorting:
- MLLM @ Game☆16May 12, 2025Updated 9 months ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆34Feb 25, 2026Updated last week
- Detectron2 Toolbox and Benchmark for V3Det☆18Jun 2, 2024Updated last year
- ☆49Apr 11, 2025Updated 10 months ago
- image retrieval using metric learning☆10Nov 22, 2022Updated 3 years ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆294Nov 18, 2025Updated 3 months ago
- The offical Pytorch code for "Uncertainty-aware Contrastive Distillation\\for Incremental Semantic Segmentation"☆34Mar 26, 2022Updated 3 years ago
- An official implementation of "ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation" (NeurIPS 2022)…☆49Dec 19, 2022Updated 3 years ago
- ☆37May 28, 2022Updated 3 years ago
- ☆25Aug 19, 2025Updated 6 months ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated last month
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- A scalable data preprocessing framework built on PySpark for LLM training☆23Dec 9, 2025Updated 3 months ago
- 神经辐射场 论文学习☆10Sep 25, 2021Updated 4 years ago
- [AAAI 2026] TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution☆14Aug 1, 2025Updated 7 months ago
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated last month
- ☆13Jun 22, 2022Updated 3 years ago
- a feature frontend for VINS☆10Aug 27, 2018Updated 7 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- My templates used in OI. All C++.☆11Jul 17, 2018Updated 7 years ago
- Central difference kalman filter which can work with states on a manifold☆12Feb 26, 2021Updated 5 years ago
- ☆11Feb 28, 2024Updated 2 years ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- The code for On Robust Cross-View Consistency in Outdoor Self-Supervised Monocular Depth Estimation☆13Jun 2, 2023Updated 2 years ago
- ☆11Sep 15, 2016Updated 9 years ago
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- ☆56Mar 6, 2025Updated last year
- This repository contains a curated list of papers, code, and other resources related to the automatic colorization of images using deep l…☆15Jul 20, 2023Updated 2 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- 5 point;ekf;gazebo;g2o;loop closure;☆12Dec 23, 2015Updated 10 years ago
- Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆33Dec 9, 2025Updated 3 months ago
- Loop Clousure Detector☆13Feb 2, 2018Updated 8 years ago
- Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)☆16Jan 7, 2025Updated last year