[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".
☆21Jun 16, 2025Updated 9 months ago
Alternatives and similar repositories for NoteMR
Users that are interested in NoteMR are comparing it to the libraries listed below
Sorting:
- Code for "Dual-Level Adaptive Incongruity-Enhanced Model for Multimodal Sarcasm Detection".☆30Mar 20, 2025Updated last year
- ☆10Oct 2, 2017Updated 8 years ago
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- 基于bert中文多分类模型☆11Mar 23, 2019Updated 6 years ago
- 通过手势识别控制播放器☆10Mar 13, 2020Updated 6 years ago
- 基于Bert、Pytorch的中文短文本分类任务☆13Nov 2, 2022Updated 3 years ago
- Wave Function Collapse x Stable Diffusion, tile map generation with diffusion algorithm☆24Jul 3, 2023Updated 2 years ago
- ☆14Apr 25, 2025Updated 10 months ago
- This technical demo is an open-source project that allows users to customize the appearance and design of the map in game with stable dif…☆25Apr 19, 2023Updated 2 years ago
- ☆18Jul 3, 2025Updated 8 months ago
- ☆37May 28, 2025Updated 9 months ago
- [NeurIPS 2025] Official website for code and models of Time Series RAG (TS-RAG)☆49Mar 1, 2026Updated 2 weeks ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆28Dec 18, 2025Updated 3 months ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago
- I know Kung Fu☆24Mar 27, 2025Updated 11 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆38Dec 2, 2025Updated 3 months ago
- Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).☆14Jan 9, 2025Updated last year
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 6 months ago
- Image Caption Generator implemented using Tensorflow and Keras in a Python Jupyter Notebook. The goal is to describe the content of an im…☆32Feb 17, 2021Updated 5 years ago
- bert实现中文NER☆30Aug 10, 2022Updated 3 years ago
- Official code for "Enabling Uncertainty Estimation in Iterative Neural Networks" (ICML 2024)☆19Jul 8, 2024Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- ☆10Apr 16, 2024Updated last year
- adapt data to and from every format☆28Feb 15, 2026Updated last month
- mit6.830 all-pass☆12Mar 25, 2022Updated 3 years ago
- ☆26May 13, 2025Updated 10 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆22May 7, 2025Updated 10 months ago
- FD-MVLLM: Fault Diagnosis Based on Multimodal Vibration Data and Large Language Model for Bearing☆63Jan 21, 2026Updated 2 months ago
- Official implementation of Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement.☆42Dec 21, 2023Updated 2 years ago
- Official implementation of SGDiff (ACM MM '23)☆37Nov 26, 2023Updated 2 years ago
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated last year
- ☆19Aug 15, 2018Updated 7 years ago
- [ICLR 2025] COME: Test-time Adaption by Conservatively Minimizing Entropy☆18Mar 5, 2025Updated last year
- Portable auto-vectorizable n-body benchmark☆20Feb 25, 2026Updated 3 weeks ago
- Implements LLM-Lasso☆39Jul 28, 2025Updated 7 months ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 6 years ago
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆48Sep 25, 2025Updated 5 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆22Jul 16, 2021Updated 4 years ago