[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".
☆25Jun 16, 2025Updated last year
Alternatives and similar repositories for NoteMR
Users that are interested in NoteMR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- rmp data ranking☆13Nov 4, 2025Updated 7 months ago
- Using image captions with LLM for zero-shot VQA☆19Mar 14, 2024Updated 2 years ago
- ☆15Apr 25, 2025Updated last year
- ☆19Jul 3, 2025Updated 11 months ago
- ☆39May 28, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆30Dec 18, 2025Updated 6 months ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago
- I know Kung Fu☆25Mar 27, 2025Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆46Dec 2, 2025Updated 6 months ago
- 專題論文☆10Jul 27, 2013Updated 12 years ago
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 9 months ago
- Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).☆15Jan 9, 2025Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- ☆10Apr 16, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- adapt data to and from every format☆28Apr 27, 2026Updated 2 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆25May 7, 2025Updated last year
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated last year
- ☆19Aug 15, 2018Updated 7 years ago
- A Monte Carlo Neutron Transport Mini-App☆15Apr 15, 2019Updated 7 years ago
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆56Sep 25, 2025Updated 9 months ago
- [ICLR 2025] COME: Test-time Adaption by Conservatively Minimizing Entropy☆23Mar 5, 2025Updated last year
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated 2 years ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆17Sep 12, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆22Jul 16, 2021Updated 4 years ago
- Source-free Domain Generalization☆16Sep 24, 2024Updated last year
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆41Jan 12, 2026Updated 5 months ago
- 🚀 Beautiful React Native UI library☆16Dec 26, 2025Updated 6 months ago
- This is a simplified demo for the paper: Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval☆15Sep 13, 2020Updated 5 years ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆149Mar 6, 2025Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆32Jun 26, 2024Updated 2 years ago
- Code for Goal-Aware Prediction: Learning to Model what Matters☆20Jul 15, 2020Updated 5 years ago
- ☆14Oct 16, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆34Mar 28, 2025Updated last year
- ☆32May 17, 2024Updated 2 years ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆114Feb 16, 2025Updated last year
- [CVPR 2025] Open-World Amodal Appearance Completion☆56Nov 10, 2025Updated 7 months ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 11 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago