curryqka / AgentThinkLinks
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
☆136Updated 4 months ago
Alternatives and similar repositories for AgentThink
Users that are interested in AgentThink are comparing it to the libraries listed below
Sorting:
- ☆93Updated 7 months ago
- ☆223Updated 3 months ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving☆166Updated last week
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Updated 2 months ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆135Updated last week
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆178Updated 3 weeks ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆272Updated 3 months ago
- ☆321Updated 3 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆380Updated last month
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆130Updated 4 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆482Updated 3 weeks ago
- A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.☆64Updated 2 months ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆75Updated 6 months ago
- A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨☆27Updated 9 months ago
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆216Updated last month
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆270Updated 3 months ago
- ☆55Updated 2 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 3 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆232Updated 3 months ago
- your finance bro Agent for trading and investing☆108Updated 3 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated 2 months ago
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Updated 2 months ago
- WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving☆171Updated last week
- 🐾 PawHaven — An open-source, enterprise-ready full-stack project powered by React, NestJS, and pnpm, featuring a Monorepo architecture t…☆87Updated this week
- CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation☆17Updated 8 months ago
- ☆104Updated 4 months ago
- A lightweight React component that renders its children only on the client side, helping avoid SSR hydration errors in frameworks like Ne…☆31Updated 2 months ago
- ☆117Updated 5 months ago