[Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search
☆102Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for VisuoThink
Users that are interested in VisuoThink are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📚 TG-EDU综合教育平台 | 支持作业提交📝、批量评分✅、补交申请🔄、团队协作👥、成绩统计📊☆110Mar 24, 2026Updated 3 weeks ago
- Flutter × Riverpod × LangChain で構築した LINE 風 UI の ChatGPT クライアント。Markdown レンダリングやストリーミング応答に対応し、モバイル/デスクトップ両対応のクロスプラットフォーム LLM フロントエンド☆33Oct 18, 2025Updated 5 months ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆18Jul 22, 2025Updated 8 months ago
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated 10 months ago
- The official implementation of our NeurIPS 2025 Poster paper: Precise Diffusion Inversion: Towards Novel Samples and Few-Step Model☆90Nov 30, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Course Website for OS Autumn 2021 at Fudan University☆14Feb 1, 2022Updated 4 years ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆48Jul 22, 2025Updated 8 months ago
- This is the repository for the paper ‘A Survey of Inductive Reasoning for Large Language Models’☆46Apr 8, 2026Updated last week
- This project frames the zoning problem as a mixed-integer linear program (MILP) defined over a spatial grid of planning units.☆81Oct 29, 2025Updated 5 months ago
- [NeurIPS 2025] Codes for paper Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation …☆133Sep 20, 2025Updated 6 months ago
- OpenThinkIMG is an end-to-end open-source framework that empowers Large Vision-Language Models to think with images.☆120Jul 11, 2025Updated 9 months ago
- A dataset for fall detection using photorealistic virtual environments.☆48Feb 6, 2025Updated last year
- Repository of IPBench☆20Apr 6, 2026Updated last week
- A programming language version manager 🚀 🚀☆301Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for "Filling MIDI Velocity using U-Net Image Colorizer" (CMMR2025) PyTorch implementation for filling MIDI velocities from given MID…☆39Dec 1, 2025Updated 4 months ago
- 3D generation made easy!☆448Apr 6, 2026Updated last week
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆756Updated this week
- An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.☆37Oct 6, 2025Updated 6 months ago
- Course Website for OS 2020 Fall at Fudan University☆21May 31, 2021Updated 4 years ago
- ☆133Mar 22, 2025Updated last year
- BERT-based AI-generated academic text detection model☆222Mar 31, 2026Updated 2 weeks ago
- ☆45Jan 26, 2026Updated 2 months ago
- 复旦大学绩点&给分查询工具☆25Sep 24, 2011Updated 14 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆82Aug 19, 2025Updated 7 months ago
- ☆27Dec 30, 2025Updated 3 months ago
- Efflux desktop service☆388Jul 17, 2025Updated 8 months ago
- 为您的网站或APP接入USDT收款,无需区块链知识,支持Telegram和独角数卡,支持回调,支持各种编程语言,整个过程只需2步,小白也能接入。a USDT wallet development and automatic payment APIs☆216Updated this week
- 🔥minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,minerproxy,矿池抽水,矿池中转,矿场运维专用☆3,412Apr 2, 2026Updated last week
- ☆1,000Sep 10, 2025Updated 7 months ago
- [NAACL 2025] SIUO: Cross-Modality Safety Alignment☆124Jan 31, 2025Updated last year
- This is a official repository for MExD☆19Oct 27, 2025Updated 5 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆1,043Dec 8, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Align Anything: Training All-modality Model with Feedback☆4,646Nov 27, 2025Updated 4 months ago
- Chat with Arxiv Paper 📑 (ChatGPT) / 通过对话理解论文☆27Apr 6, 2023Updated 3 years ago
- ☆40Dec 7, 2025Updated 4 months ago
- [NeurIPS'24] Free Lunch in Pathology Foundation Model: Task-specific Model Adaptation with Concept-Guided Feature Enhancement - NeurIPS 2…☆15Nov 19, 2024Updated last year
- MCPCAN is a centralized management platform for MCP services. It deploys each MCP service using a container deployment method. The platfo…☆716Apr 3, 2026Updated last week
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆587Apr 1, 2026Updated last week
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,165Dec 15, 2025Updated 4 months ago