☆30Apr 16, 2024Updated 2 years ago
Alternatives and similar repositories for assistgui
Users that are interested in assistgui are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆124Jul 27, 2025Updated 10 months ago
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆36Jan 2, 2024Updated 2 years ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆39Nov 11, 2025Updated 7 months ago
- ☆20Apr 24, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆33Sep 27, 2024Updated last year
- Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"☆12Apr 4, 2022Updated 4 years ago
- ☆13Jun 14, 2023Updated 3 years ago
- [ICLR 2026] - One2Scene☆45May 25, 2026Updated 3 weeks ago
- 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.☆1,193Aug 17, 2025Updated 10 months ago
- GUICourse: From General Vision Langauge Models to Versatile GUI Agents☆142Mar 1, 2026Updated 3 months ago
- Under construction☆13Jan 15, 2025Updated last year
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆40Sep 28, 2025Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆232Jun 16, 2025Updated last year
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆66Feb 22, 2026Updated 3 months ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- ☆46Mar 19, 2024Updated 2 years ago
- Pythonic wrappers for Cider/CiderD evaluation metrics. Provides CIDEr as well as CIDEr-D (CIDEr Defended) which is more robust to gaming …☆13Dec 4, 2025Updated 6 months ago
- ☆18Nov 1, 2024Updated last year
- ☆12Aug 8, 2024Updated last year
- Official implementation of "MedITok: A Unified Tokenizer for Medical Image Synthesis and Interpretation"☆29Apr 3, 2026Updated 2 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆41Apr 9, 2026Updated 2 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆161Feb 11, 2025Updated last year
- This project explores the different techniques (both scalable and non scalable) for Graph based semi supervised learning. Recent techniqu…☆14May 28, 2016Updated 10 years ago
- Event-Driven BackTesting Framework☆16Aug 22, 2018Updated 7 years ago
- [NeurIPS 2024] Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators☆16Nov 15, 2024Updated last year
- ☆20Aug 30, 2023Updated 2 years ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- code for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection☆19Mar 4, 2024Updated 2 years ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆50Oct 1, 2025Updated 8 months ago
- FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)☆24Sep 24, 2023Updated 2 years ago
- ☆14Jul 25, 2025Updated 10 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,861Apr 24, 2026Updated last month
- ☆232Apr 21, 2026Updated last month
- Send and receive doginals☆13Mar 17, 2023Updated 3 years ago
- DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆32Mar 8, 2026Updated 3 months ago