[CVPR 2025] Code for "Notes-guided MLLM Reasoning: Enhancing MLLM with Knowledge and Visual Notes for Visual Question Answering".
☆25Jun 16, 2025Updated 11 months ago
Alternatives and similar repositories for NoteMR
Users that are interested in NoteMR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- rmp data ranking☆13Nov 4, 2025Updated 6 months ago
- Using image captions with LLM for zero-shot VQA☆19Mar 14, 2024Updated 2 years ago
- ☆14Apr 25, 2025Updated last year
- ☆38May 28, 2025Updated 11 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆29Dec 18, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆42Dec 2, 2025Updated 5 months ago
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆39Sep 10, 2025Updated 8 months ago
- Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).☆15Jan 9, 2025Updated last year
- ☆11Dec 13, 2023Updated 2 years ago
- ☆10Apr 16, 2024Updated 2 years ago
- adapt data to and from every format☆28Apr 27, 2026Updated 3 weeks ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆24May 7, 2025Updated last year
- ☆13Oct 4, 2023Updated 2 years ago
- Portable auto-vectorizable n-body benchmark☆21Feb 25, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined☆53Sep 25, 2025Updated 7 months ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Jun 22, 2024Updated last year
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆17Sep 12, 2024Updated last year
- ☆21Oct 9, 2025Updated 7 months ago
- ☆11Feb 3, 2024Updated 2 years ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 7 months ago
- 🚀 Beautiful React Native UI library☆16Dec 26, 2025Updated 4 months ago
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆145Mar 6, 2025Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆32Jun 26, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Goal-Aware Prediction: Learning to Model what Matters☆20Jul 15, 2020Updated 5 years ago
- A C++based implementation of the TeaLeaf heat conduction mini-app. This implementation of TeaLeaf replicates the functionality of the ref…☆25Aug 11, 2024Updated last year
- ☆34Mar 28, 2025Updated last year
- ☆11Sep 30, 2024Updated last year
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆109Feb 16, 2025Updated last year
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- 非雇员OD管理复盘与面试改进思考☆16Jul 2, 2025Updated 10 months ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆96Nov 29, 2025Updated 5 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [MICCAI 2024 workshop] Official implementation of "SemiT-SAM: Building a Visual Foundation Model for Tooth Instance Segmentation on Panor…☆16Nov 13, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- AskYP is an open-source AI chatbot that uses OpenAI Functions and the Vercel AI SDK to interact with the Yelp Fusion API with natural lan…☆18Aug 27, 2023Updated 2 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- ☆13Apr 28, 2025Updated last year
- ☆29Dec 12, 2023Updated 2 years ago
- [ICML2025] Test-Time Learning for Large Language Models☆56Jan 31, 2026Updated 3 months ago