Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
☆242Jan 20, 2026Updated last month
Alternatives and similar repositories for ShowUI-Aloha
Users that are interested in ShowUI-Aloha are comparing it to the libraries listed below
Sorting:
- ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands☆97Feb 6, 2026Updated last month
- ☆14Jul 17, 2025Updated 7 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Jan 30, 2026Updated last month
- Under construction☆13Jan 15, 2025Updated last year
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆86Aug 18, 2025Updated 6 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆63Dec 11, 2025Updated 2 months ago
- Bypass browser bot detection in langchain tools☆17Feb 10, 2026Updated 3 weeks ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆52Feb 23, 2026Updated last week
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆41Feb 9, 2026Updated 3 weeks ago
- ☆27May 30, 2025Updated 9 months ago
- ☆26Jan 28, 2026Updated last month
- ☆28Jun 5, 2025Updated 9 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Jan 12, 2026Updated last month
- ☆27Jun 18, 2025Updated 8 months ago
- 👻 kwami.io | A 3D Interactive AI Companion Library for creating engaging AI companions with visual (blob), audio, and AI speech capabili…☆42Feb 20, 2026Updated 2 weeks ago
- ☆44Sep 3, 2025Updated 6 months ago
- TUI monitor for OpenClaw sub-agents and more☆62Feb 21, 2026Updated 2 weeks ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆107Feb 21, 2026Updated last week
- End2End Virtual Try-on with Visual Reference, CVPR2026☆58Nov 19, 2025Updated 3 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Nov 20, 2025Updated 3 months ago
- [CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models☆77Feb 26, 2026Updated last week
- AI Agent - It can book rides, order food, post tweets and also control basic tasks on device.☆36Aug 24, 2025Updated 6 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆72Jan 15, 2026Updated last month
- MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents☆250Feb 22, 2026Updated last week
- ☆86Feb 4, 2026Updated last month
- GenFilesMCP: Minimal MCP Server for Open Web UI. Generates PPTX, XLSX, DOCX or MD files using user requests and full chat context. *Pul…☆69Feb 19, 2026Updated 2 weeks ago
- Official PyTorch Implementation of Ctrl-Crash 💥☆51Jun 3, 2025Updated 9 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆172Feb 4, 2026Updated last month
- Code release for AccDiffusionV2 (TPAMI)☆35Nov 4, 2025Updated 4 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆51Jan 5, 2026Updated 2 months ago
- ☆40Jul 15, 2025Updated 7 months ago
- [NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any ima…☆749Sep 24, 2025Updated 5 months ago
- MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.☆30Jul 20, 2025Updated 7 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 8 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆126Jan 29, 2026Updated last month
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆60Updated this week