GroundCUA
☆68Dec 24, 2025Updated 2 months ago
Alternatives and similar repositories for GroundCUA
Users that are interested in GroundCUA are comparing it to the libraries listed below
Sorting:
- ☆30Jul 3, 2025Updated 7 months ago
- ☆25Jan 28, 2026Updated last month
- 📚 A collection of resources and papers on Large Language Models in autonomous driving☆27Oct 30, 2023Updated 2 years ago
- R1-like Computer-use Agent☆89Mar 21, 2025Updated 11 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆136Nov 19, 2025Updated 3 months ago
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆52Feb 21, 2025Updated last year
- Collect the awesome works evolved around reasoning models like O1/R1 in visual domain☆53Jul 21, 2025Updated 7 months ago
- a implementation of vibe with python☆11Jul 27, 2018Updated 7 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- 2023年秋季北航编译原理课程☆13Dec 25, 2024Updated last year
- 用AI从0开始制作“研究生模拟器”小游戏☆42Jan 20, 2026Updated last month
- ☆23Feb 12, 2026Updated 2 weeks ago
- Open-source clone of OpenAI's Deep Research. Works with any transformer, gpt4free, & runs in browser. No Firecrawl needed.☆12Jun 12, 2025Updated 8 months ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆12Dec 31, 2024Updated last year
- ☆17Oct 30, 2023Updated 2 years ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- Progressive Language-guided Visual Learning for Multi-Task Visual Grounding☆13May 9, 2025Updated 9 months ago
- In this project, facial recognition algorithm is implemented with python using PCA and SVD dimensionality reduction tools.☆10Sep 2, 2019Updated 6 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 9 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆28Jan 5, 2026Updated last month
- Collections of Papers and Projects for Multimodal Reasoning.☆107Apr 25, 2025Updated 10 months ago
- An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanj…☆48Nov 13, 2024Updated last year
- Implementation of SimMOD: A Simple Baseline for Multi-Camera 3D Object Detection☆48Jan 17, 2023Updated 3 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated 11 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 7 months ago
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆13May 3, 2024Updated last year
- This repository including most of cnn visualizations techniques using pytorch☆14Apr 14, 2020Updated 5 years ago
- Python package for downloading and formatting the UK's Road Safety Data.☆16Aug 3, 2024Updated last year
- A browser based CadQuery server☆12Feb 18, 2025Updated last year
- [NeurIPS 2024] Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators☆16Nov 15, 2024Updated last year
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆24Aug 8, 2025Updated 6 months ago
- ☆12Oct 10, 2023Updated 2 years ago
- Under construction☆13Jan 15, 2025Updated last year
- Generate a 3D BIM Model from 2D CAD Drawings☆12Nov 23, 2022Updated 3 years ago
- Explore Go Awesome Standard library one module at the time☆15Jan 9, 2018Updated 8 years ago