Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
☆309Jan 20, 2026Updated 4 months ago
Alternatives and similar repositories for ShowUI-Aloha
Users that are interested in ShowUI-Aloha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands☆128Apr 22, 2026Updated last month
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆58Feb 23, 2026Updated 3 months ago
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆65Feb 28, 2026Updated 3 months ago
- AI Agent - It can book rides, order food, post tweets and also control basic tasks on device.☆38Aug 24, 2025Updated 9 months ago
- The agent runtime built to operate.☆138May 25, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated 2 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆68Apr 3, 2026Updated 2 months ago
- Rogue your vibe hero like rogue like.☆28Nov 8, 2025Updated 7 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆178Feb 4, 2026Updated 4 months ago
- Official PyTorch Implementation of Ctrl-Crash 💥☆53Jun 3, 2025Updated last year
- This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models☆22Apr 27, 2024Updated 2 years ago
- A multilingual tool to convert PDF ebooks to audiobooks using XTTS v2 TTS model by cloning a speaker voice.☆18Jan 22, 2025Updated last year
- ☆31May 30, 2025Updated last year
- End2End Virtual Try-on with Visual Reference, CVPR2026☆68Apr 18, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 3 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 7 months ago
- [NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any ima…☆805Sep 24, 2025Updated 8 months ago
- 🚀 Free and open-source landing page template built with Next.js and Shadcn UI.☆20Jun 10, 2026Updated last week
- [CVPR 2026] FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning☆59Mar 26, 2026Updated 2 months ago
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 3 years ago
- ☆192Jul 31, 2025Updated 10 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆87Mar 3, 2026Updated 3 months ago
- [ACL 2026] Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration☆24Apr 11, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆120May 17, 2026Updated last month
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- [CVPR 2026] Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".☆262Apr 17, 2026Updated 2 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆449Dec 2, 2025Updated 6 months ago
- Python-based automated 2D animation tool that generates videos from text scripts and audio files. Uses AI for text analysis, lip sync, an…☆32Oct 13, 2025Updated 8 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆35Jun 7, 2026Updated last week
- A fully client-side chat application with AI capabilities running entirely in your browser. No servers, complete privacy, and persistent …☆15Mar 14, 2025Updated last year
- [CVPR 2026] FrankenMotion: Part-level Human Motion Generation and Composition☆245May 13, 2026Updated last month
- Media server with remote control via telegram☆17Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆124Jul 27, 2025Updated 10 months ago
- [CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding☆49Feb 28, 2026Updated 3 months ago
- ☆11Sep 19, 2025Updated 9 months ago
- ☆89Dec 23, 2025Updated 5 months ago
- ☆31Oct 8, 2025Updated 8 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆82May 2, 2026Updated last month
- ☆460Dec 8, 2025Updated 6 months ago