MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning
☆40May 7, 2026Updated 3 weeks ago
Alternatives and similar repositories for MindWatcher
Users that are interested in MindWatcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Detecting and Evaluating Medical Hallucinations in Large Vision Language Models☆12Jun 24, 2024Updated last year
- Code for "Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection"☆31Nov 7, 2023Updated 2 years ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- [MICCAI 2024] Can LLMs' Tuning Methods Work in Medical Multimodal Domain?☆17Sep 18, 2024Updated last year
- ☆11Jun 27, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- dairly learning☆10Jul 10, 2022Updated 3 years ago
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…☆110Dec 1, 2025Updated 5 months ago
- LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀☆15Jul 12, 2021Updated 4 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- Fast CNN Stereo Depth Estimation through Embedded GPU Device☆18Nov 22, 2022Updated 3 years ago
- Rank-consistent Oridinal Regression☆17Dec 24, 2019Updated 6 years ago
- ☆10Apr 19, 2022Updated 4 years ago
- [ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆22Apr 7, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated last year
- A digital twin of the city of Chicago along with automated sensors☆13Nov 14, 2019Updated 6 years ago
- D-LSD: a Distorted Line Segment Detector for Calibrated Images☆18May 19, 2021Updated 5 years ago
- NightSurveillance Sataset for Pedestrian Detection☆11Jul 30, 2020Updated 5 years ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆70Jan 23, 2026Updated 4 months ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2021 -- Network Pruning using Adaptive Exemplar Filters☆24Apr 4, 2021Updated 5 years ago
- [ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆140May 4, 2026Updated 3 weeks ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year
- ☆29Apr 7, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- MXNet-Gluon model to Caffe (support SSD in gluoncv)☆10Jun 20, 2019Updated 6 years ago
- This code implements the code of the paper ,"Neighbor2Neighbor:Self-Supervised Denoising From Single Noisy Images",in2021☆16Feb 26, 2021Updated 5 years ago
- SiamAtt: Siamese attention network for visual tracking☆15Apr 29, 2021Updated 5 years ago
- ☆28Jun 12, 2025Updated 11 months ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Feb 15, 2022Updated 4 years ago
- De novo analysis for cryo-electron tomography☆25Nov 8, 2024Updated last year
- PyTorch implementation of NeurIPS 2020 paper "Pruning Filter in Filter".☆18Jan 4, 2021Updated 5 years ago
- Official Pytorch implementation for the paper "Single Stage Class Agnostic Common Object Detection"☆17Nov 17, 2020Updated 5 years ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆89Nov 27, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆65Mar 28, 2026Updated 2 months ago
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated 3 weeks ago
- "FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track☆12Jun 20, 2024Updated last year
- Better Than Reference In Low Light Image Enhancement: Conditional Re-Enhancement Networks☆17Jan 14, 2025Updated last year
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆45Oct 19, 2025Updated 7 months ago
- Official repo for "TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders"☆25Apr 9, 2026Updated last month