[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".
☆20Jun 16, 2025Updated 8 months ago
Alternatives and similar repositories for NoteMR
Users that are interested in NoteMR are comparing it to the libraries listed below
Sorting:
- rmp data ranking☆13Nov 4, 2025Updated 3 months ago
- ☆37May 28, 2025Updated 9 months ago
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 5 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆26Dec 18, 2025Updated 2 months ago
- Code for "Dual-Level Adaptive Incongruity-Enhanced Model for Multimodal Sarcasm Detection".☆28Mar 20, 2025Updated 11 months ago
- I know Kung Fu☆22Mar 27, 2025Updated 11 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Dec 2, 2025Updated 2 months ago
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated last year
- adapt data to and from every format☆28Feb 15, 2026Updated last week
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆45Sep 25, 2025Updated 5 months ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- [CVPR 2025] Open-World Amodal Appearance Completion☆51Nov 10, 2025Updated 3 months ago
- High Security Surveillance Camera using OpenCV, Python & Arduino☆12Jun 20, 2020Updated 5 years ago
- ☆11Sep 30, 2024Updated last year
- Script parses Interactive Brokers trade report to aid in Finnish tax report fill☆13Jan 10, 2024Updated 2 years ago
- Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrati…☆17Feb 20, 2026Updated last week
- 通过手势识别控制播放器☆11Mar 13, 2020Updated 5 years ago
- ☆10Oct 2, 2017Updated 8 years ago
- A cog implementation of mPLUG-Owl🦉, a multimodal large language model☆11May 12, 2023Updated 2 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- Object detection and classification☆12Oct 19, 2018Updated 7 years ago
- A python library that supports all vector databases specifically for LLM apps and frameworks☆13May 3, 2023Updated 2 years ago
- ☆18Jul 3, 2025Updated 7 months ago
- Machine Learning and Deep Learning with examples.☆10Feb 26, 2019Updated 7 years ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 7 months ago
- mit6.830 all-pass☆12Mar 25, 2022Updated 3 years ago
- A cross-platform ZeroTier desktop client. Build with Tauri, Rust, Vite, React, Zustand, Next UI and Tailwind CSS☆10Oct 24, 2025Updated 4 months ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Prompt Free, Soul Driven AI Assistant☆28Feb 19, 2026Updated last week
- Concurrent TikTok video downloader without watermark. (Snaptik)☆13Dec 16, 2023Updated 2 years ago
- 🚀 Beautiful React Native UI library☆15Dec 26, 2025Updated 2 months ago
- ☆14Oct 11, 2024Updated last year
- Exploration of World Languages☆19Apr 5, 2024Updated last year
- Building instance segmentation using Mask RCNN☆11Nov 15, 2017Updated 8 years ago
- ☆13Nov 25, 2022Updated 3 years ago
- ☆14Apr 25, 2025Updated 10 months ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago