Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
☆277Jan 20, 2026Updated 2 months ago
Alternatives and similar repositories for ShowUI-Aloha
Users that are interested in ShowUI-Aloha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jul 17, 2025Updated 9 months ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆56Feb 23, 2026Updated last month
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆64Feb 28, 2026Updated last month
- Professional Markdown editor with Mermaid diagrams & KaTeX formulas. Zero-config, pure static, export to PDF/HTML. Perfect for technical …☆37Jan 14, 2026Updated 3 months ago
- Under construction☆13Jan 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents☆32Jun 3, 2025Updated 10 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆69Dec 11, 2025Updated 4 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆55Apr 3, 2026Updated 2 weeks ago
- A cross-platform GPU monitor TUI with support for both Apple Silicon and NVIDIA GPUs.☆85Mar 5, 2026Updated last month
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆42Mar 24, 2026Updated 3 weeks ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆174Feb 4, 2026Updated 2 months ago
- Official PyTorch Implementation of Ctrl-Crash 💥☆52Jun 3, 2025Updated 10 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆572Nov 19, 2025Updated 5 months ago
- ☆27May 30, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TUI monitor for OpenClaw sub-agents and more☆71Mar 13, 2026Updated last month
- Small Image Processor - Ultra memory-efficient image processing for Cloudflare Workers 🟠☆119Apr 9, 2026Updated last week
- [NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any ima…☆785Sep 24, 2025Updated 6 months ago
- A set of tools to create synthetically-generated data from documents☆46Aug 15, 2025Updated 8 months ago
- Swift wrapper for the Bitcoin Core RPC☆14Dec 27, 2021Updated 4 years ago
- ☆191Jul 31, 2025Updated 8 months ago
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 3 years ago
- DoubleAI’s hyperoptimised version of cuGraph☆50Mar 3, 2026Updated last month
- [CVPR 2026] Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".☆229Apr 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2026] Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration☆21Apr 11, 2026Updated last week
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 4 months ago
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆452Dec 2, 2025Updated 4 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆31Feb 10, 2026Updated 2 months ago
- ☆33Sep 19, 2025Updated 7 months ago
- ☆78Dec 23, 2025Updated 3 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆301Jul 15, 2025Updated 9 months ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆116Jul 27, 2025Updated 8 months ago
- ☆33Jun 18, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆29Oct 8, 2025Updated 6 months ago
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆44Feb 9, 2026Updated 2 months ago
- Hands-On Tutorial on Building Multimodal RAG Systems☆13Apr 10, 2025Updated last year
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆74Mar 26, 2026Updated 3 weeks ago
- Bypass browser bot detection in langchain tools☆18Feb 10, 2026Updated 2 months ago
- MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents☆407Mar 31, 2026Updated 2 weeks ago
- SimpleMem: Efficient Lifelong Memory for LLM Agents — Text & Multimodal