MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning
☆40May 7, 2026Updated last month
Alternatives and similar repositories for MindWatcher
Users that are interested in MindWatcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Detecting and Evaluating Medical Hallucinations in Large Vision Language Models☆12Jun 24, 2024Updated last year
- [ICANN 2024 (Oral)] MISS: A Generative Pre-training and Fine-tuning Approach for Med-VQA☆12Aug 8, 2024Updated last year
- Code for "Exploring Grounding Potential of VQA-oriented GPT-4V for Zero-shot Anomaly Detection"☆31Nov 7, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".☆71Feb 28, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [MICCAI 2024] Can LLMs' Tuning Methods Work in Medical Multimodal Domain?☆17Sep 18, 2024Updated last year
- FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION(ICASSP2020)☆10May 7, 2020Updated 6 years ago
- ☆11Jun 27, 2022Updated 3 years ago
- [Preprint] "CERL: Coordinated Enhancement for Real-World Low-Light Noisy Images" by Zeyuan Chen, Yifan Jiang, Dong Liu and Zhangyang Wang☆15Feb 24, 2022Updated 4 years ago
- dairly learning☆10Jul 10, 2022Updated 3 years ago
- 项目中需要的SR超分辨率论文 Image Super-Resolution using Gradient Profile Prior C++实现☆12Jul 24, 2019Updated 6 years ago
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…☆113Dec 1, 2025Updated 6 months ago
- pytorch version of code completion with neural attention and pointer networks☆12Jan 17, 2020Updated 6 years ago
- ☆14Nov 4, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- (Unstructured) Weight Pruning via Adaptive Sparsity Loss☆15Sep 28, 2022Updated 3 years ago
- Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks☆15Feb 17, 2025Updated last year
- ☆17Jul 10, 2022Updated 3 years ago
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- Fast CNN Stereo Depth Estimation through Embedded GPU Device☆18Nov 22, 2022Updated 3 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 3 years ago
- ☆12Jan 10, 2025Updated last year
- A digital twin of the city of Chicago along with automated sensors☆13Nov 14, 2019Updated 6 years ago
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- D-LSD: a Distorted Line Segment Detector for Calibrated Images☆18May 19, 2021Updated 5 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2021 -- Network Pruning using Adaptive Exemplar Filters☆24Apr 4, 2021Updated 5 years ago
- [ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆158May 4, 2026Updated last month
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year
- Pytorch reimplement of PPYOLOv2, PPYOLO, YOLOv3☆14Jul 17, 2021Updated 4 years ago
- Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes (ICCV2021)☆20Mar 19, 2024Updated 2 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- ☆28Jun 12, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- CycleGAN adaptation for day-to-night domain transfer of driving-related scenes.☆14Apr 9, 2019Updated 7 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Feb 15, 2022Updated 4 years ago
- 包括机器学习、深度学习、计算机视觉等领域前沿论文的解读。☆10Dec 4, 2021Updated 4 years ago
- Official Pytorch implementation for the paper "Single Stage Class Agnostic Common Object Detection"☆17Nov 17, 2020Updated 5 years ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆91Nov 27, 2025Updated 6 months ago
- Custom ComfyUI node that combines VSR + VFI and allows streaming processing for arbitrary video length.☆66Mar 28, 2026Updated 2 months ago
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago