MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
☆142Oct 10, 2025Updated 7 months ago
Alternatives and similar repositories for MLLM-Tool
Users that are interested in MLLM-Tool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2024] TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding☆13May 30, 2024Updated last year
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆51Aug 31, 2021Updated 4 years ago
- ☆30Jun 19, 2024Updated last year
- [ACM MM 2024] GeoFormer: Learning Point Cloud Completion with Tri-Plane Integrated Transformer☆35Nov 26, 2024Updated last year
- The codes for RFNet: Recurrent Forward Network for Dense Point Cloud Completion☆20Jan 17, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] 3D StreetUnveiler with Semantic-aware 2DGS - a simple baseline☆126Jun 16, 2025Updated 11 months ago
- [TPAMI 2024] DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction☆122Feb 26, 2026Updated 3 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆71Aug 5, 2025Updated 9 months ago
- Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation (ECCV2024)☆14Nov 1, 2024Updated last year
- [ECCV 2022] The official repo for the paper "UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation".☆71Aug 11, 2023Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆30Feb 4, 2026Updated 3 months ago
- CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM☆261Sep 16, 2025Updated 8 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆68Mar 22, 2026Updated 2 months ago
- The codes for ECCV'22: Resolution-free Point Cloud Sampling Network with Data Distillation☆18Feb 16, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- [ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"☆38Mar 11, 2026Updated 2 months ago
- [ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction☆240Aug 4, 2025Updated 9 months ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆36Aug 20, 2025Updated 9 months ago
- ☆25May 13, 2024Updated 2 years ago
- ☆80Jul 3, 2024Updated last year
- ☆491Sep 25, 2024Updated last year
- Official implementation of "Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation", ECCV2024☆20Nov 12, 2024Updated last year
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆53Sep 30, 2024Updated last year
- The implementation of MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis☆42Jul 15, 2024Updated last year
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆45Sep 12, 2024Updated last year
- ☆15Jan 9, 2026Updated 4 months ago
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆27May 23, 2024Updated 2 years ago
- ☆49Nov 19, 2023Updated 2 years ago
- ☆69Sep 15, 2025Updated 8 months ago
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)☆326Jan 20, 2025Updated last year
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors [ICLR 2025]☆140Apr 16, 2025Updated last year
- Awesome-BEV-Perception☆32Jun 27, 2023Updated 2 years ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Jul 29, 2023Updated 2 years ago
- [TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases☆16Jul 10, 2025Updated 10 months ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆767Feb 1, 2024Updated 2 years ago
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆47Feb 9, 2025Updated last year