curryqka / AgentThinkLinks
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
☆134Updated 3 months ago
Alternatives and similar repositories for AgentThink
Users that are interested in AgentThink are comparing it to the libraries listed below
Sorting:
- ☆94Updated 5 months ago
- ☆224Updated last month
- Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding☆57Updated last month
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆75Updated last week
- A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.☆62Updated 3 weeks ago
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆192Updated last week
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆164Updated 2 weeks ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆128Updated 2 months ago
- A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨☆27Updated 7 months ago
- DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving☆137Updated 3 weeks ago
- ☆55Updated last month
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆265Updated 2 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆230Updated 2 months ago
- your finance bro Agent for trading and investing☆105Updated last month
- 🐾 PawHaven — An open-source, enterprise-ready full-stack project powered by React, NestJS, and pnpm, featuring a Monorepo architecture t…☆86Updated this week
- ☆18Updated 5 months ago
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Updated last month
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 3 years ago
- ☆104Updated 2 months ago
- ☆303Updated 2 months ago
- A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).☆105Updated 2 months ago
- [IROS2025] OpenGS-Fusion: Open-Vocabulary Dense Mapping with Hybrid 3D Gaussian Splatting for Refined Object-Level Understanding☆75Updated 5 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆376Updated 2 weeks ago
- 🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future☆167Updated last week
- switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…☆169Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆462Updated 2 weeks ago
- A lightweight React component that renders its children only on the client side, helping avoid SSR hydration errors in frameworks like Ne…☆31Updated last month
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆125Updated last month
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47Updated 7 months ago