☆32Jul 3, 2025Updated 9 months ago
Alternatives and similar repositories for UI-Vision
Users that are interested in UI-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GroundCUA☆117Mar 24, 2026Updated 2 weeks ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 3 months ago
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…☆88Dec 1, 2025Updated 4 months ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆25Jun 17, 2025Updated 9 months ago
- ☆119Apr 8, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- Benchmark of complex, multimodal desktop-oriented tasks for advanced GUI-navigation AI agents☆24May 7, 2025Updated 11 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Feb 25, 2026Updated last month
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆103Sep 8, 2025Updated 7 months ago
- 《MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation》☆142Feb 2, 2026Updated 2 months ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆115Jul 27, 2025Updated 8 months ago
- Union-set Multi-source Model Adaptation for Semantic Segmentation☆12Oct 24, 2022Updated 3 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Jun 5, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆29Mar 18, 2026Updated 3 weeks ago
- ☆30Jan 28, 2026Updated 2 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- ☆12Nov 25, 2023Updated 2 years ago
- AI-Generated Video Detection via Perceptual Straightening (NeurIPS2025)☆34Jan 2, 2026Updated 3 months ago
- AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management☆26Apr 3, 2026Updated last week
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- ☆17Nov 26, 2024Updated last year
- [TIP2024] Official implementation of the paper ‘Perception-Distortion Balanced Super-Resolution: A Multi-Objective Optimization Perspecti…☆17Oct 1, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames☆11Jun 1, 2025Updated 10 months ago
- The official implementation of our ICCV 2023 publication, C-VisDiT☆10Oct 23, 2024Updated last year
- 📕 skills that help you connect to rednote (xiaohongshu)☆40Feb 26, 2026Updated last month
- ☆14Oct 8, 2025Updated 6 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- A demo showing off daily-bots realtime voice and convex☆11Feb 5, 2026Updated 2 months ago
- ☆33Sep 19, 2025Updated 6 months ago
- 深度学习领域论文翻译+理解☆17Feb 25, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Jul 16, 2024Updated last year
- ☆20Apr 24, 2024Updated last year
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Some simple codes to format the CSDMC2010 SPAM corpus☆11Sep 18, 2016Updated 9 years ago
- ☆10Aug 7, 2023Updated 2 years ago
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 10 months ago
- This is the project page for the HOSNeRF☆16Dec 11, 2023Updated 2 years ago