curryqka / AgentThinkLinks
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
☆135Updated 3 months ago
Alternatives and similar repositories for AgentThink
Users that are interested in AgentThink are comparing it to the libraries listed below
Sorting:
- ☆92Updated 6 months ago
- ☆224Updated 2 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆269Updated 2 months ago
- WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving☆98Updated last month
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆58Updated 2 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆128Updated 3 months ago
- A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨☆27Updated 8 months ago
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆76Updated 5 months ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆128Updated 2 weeks ago
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆174Updated this week
- ☆314Updated 3 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.☆64Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆350Updated last month
- ☆55Updated last month
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆195Updated 2 weeks ago
- ☆104Updated 3 months ago
- ☆18Updated 6 months ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 2 months ago
- 🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future☆237Updated this week
- 🐾 PawHaven — An open-source, enterprise-ready full-stack project powered by React, NestJS, and pnpm, featuring a Monorepo architecture t…☆86Updated this week
- A lightweight React component that renders its children only on the client side, helping avoid SSR hydration errors in frameworks like Ne…☆31Updated 2 months ago
- your finance bro Agent for trading and investing☆107Updated 2 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆267Updated 2 months ago
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Updated 2 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆232Updated 3 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆472Updated last month
- CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects☆38Updated 2 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆199Updated 3 years ago
- switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…☆170Updated 2 months ago