showlab / ShowUI-AlohaLinks
Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.
☆108Updated last week
Alternatives and similar repositories for ShowUI-Aloha
Users that are interested in ShowUI-Aloha are comparing it to the libraries listed below
Sorting:
- Reasoning Systems with tool use are strong zero-shot object detectors☆60Updated 3 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆129Updated 5 months ago
- ☆33Updated last year
- A powerful AI agent for browser-based interactions powered by Fireworks AI models. Navigate the web, extract content, analyze websites, a…☆46Updated 7 months ago
- ☆21Updated last year
- ACE-Step: A Step Towards Music Generation Foundation Model☆47Updated 7 months ago
- Run Ollama LLM models in Google Colab for free☆37Updated last year
- An OpenSource Deep Research library with reasoning☆169Updated last month
- ☆28Updated 8 months ago
- ☆19Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Updated 2 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated last year
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆72Updated 4 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- Daily.co + Pipecat + Tavus AI Avatar Agent☆15Updated 9 months ago
- Jockey is a conversational video agent.☆97Updated 7 months ago
- ☆18Updated 4 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆126Updated 4 months ago
- An AI focused photo manipulation tool based on Gradio☆183Updated 6 months ago
- The official GitHub Page for MiniMax☆60Updated 2 months ago
- An advanced Digital Worker framework for AI agent-driven research and process automations.☆46Updated 10 months ago
- ☆107Updated 2 months ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆224Updated 3 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆222Updated 4 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆27Updated 2 weeks ago
- This is an MCP (Model Context Protocol) Server for discovering and downloading 3D models☆28Updated 10 months ago
- Free ComfyUI Workflows☆43Updated 2 weeks ago
- Allows two LLMs to communicate and run code in the terminal☆28Updated last year
- Service for testing out the new Qwen2.5 omni model☆62Updated 8 months ago