☆32Jul 3, 2025Updated 10 months ago
Alternatives and similar repositories for UI-Vision
Users that are interested in UI-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GroundCUA☆126Mar 24, 2026Updated last month
- ☆13Nov 5, 2024Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆25Updated this week
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 11 months ago
- Lab tasks for the course on "Data Engineering for Machine Learning"☆10May 1, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆164Nov 6, 2025Updated 6 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆38Apr 23, 2026Updated 3 weeks ago
- [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos☆52Feb 22, 2026Updated 2 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆110Sep 8, 2025Updated 8 months ago
- A customizable lightweight Grad-CAM implementation☆16Nov 30, 2019Updated 6 years ago
- MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…☆144May 7, 2026Updated 2 weeks ago
- ReproZip for the Preservation of Web Applications☆17May 6, 2024Updated 2 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Jun 5, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30Mar 18, 2026Updated 2 months ago
- ☆33Jan 28, 2026Updated 3 months ago
- ☆13Nov 25, 2023Updated 2 years ago
- [ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".☆18Sep 27, 2024Updated last year
- AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management☆30Apr 10, 2026Updated last month
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 3 years ago
- ☆17Nov 26, 2024Updated last year
- An isolated environment for DNS cache poisoning attack investigation and demonstration.☆10Nov 22, 2020Updated 5 years ago
- [TIP2024] Official implementation of the paper ‘Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspecti…☆18Oct 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code for AIM2022 compressed image super-resolution☆16Apr 27, 2023Updated 3 years ago
- ☆14Oct 8, 2025Updated 7 months ago
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 3 months ago
- A demo showing off daily-bots realtime voice and convex☆11Feb 5, 2026Updated 3 months ago
- ☆34Sep 19, 2025Updated 8 months ago
- 深度学习领域论文翻译+理解☆18Feb 25, 2022Updated 4 years ago
- ☆12Jul 16, 2024Updated last year
- ☆20Apr 24, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Mar 2, 2026Updated 2 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Some simple codes to format the CSDMC2010 SPAM corpus☆11Sep 18, 2016Updated 9 years ago
- LLM as World Models using Bayesian inference☆18May 27, 2025Updated 11 months ago
- Assessing Knee OA Severity with CNN attention-based end-to-end architectures☆20Jun 26, 2019Updated 6 years ago
- This is the project page for the HOSNeRF☆16Dec 11, 2023Updated 2 years ago
- Dual-Branch Network for Portrait Image Quality Assessment☆18Sep 16, 2025Updated 8 months ago