OminousIndustries / PhoneDriverLinks
Android Phone Control With Qwen3-VL
☆124Updated 3 months ago
Alternatives and similar repositories for PhoneDriver
Users that are interested in PhoneDriver are comparing it to the libraries listed below
Sorting:
- Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.☆177Updated 3 weeks ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Updated last week
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆130Updated 5 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated last month
- The Open Framework for autonomous virtual computer agents at scale, fully open-source, safe, auditable, and production-ready.☆309Updated last week
- Dashboard v5 Coming Soon!!☆64Updated last month
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆254Updated last week
- Fast local speech-to-text for any app using faster-whisper☆147Updated last week
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆227Updated 3 months ago
- Your own personal AIGC Factory. Any picture. Any reel. The Comfy way. ©️☆116Updated this week
- ACE-Step: A Step Towards Music Generation Foundation Model☆50Updated 8 months ago
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆286Updated this week
- Create 3D files in the CLI with Small Language Model☆43Updated 3 months ago
- Run Ollama LLM models in Google Colab for free☆37Updated last year
- Free ComfyUI Workflows☆60Updated last week
- Chain apps and models to build robust AI workflows 🤗☆424Updated last week
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 9 months ago
- Memory that learns what works.☆109Updated 2 weeks ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆234Updated 6 months ago
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆228Updated 6 months ago
- VLLM Port of the Chatterbox TTS model☆365Updated 3 months ago
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …☆136Updated last month
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44Updated 8 months ago
- Service for testing out the new Qwen2.5 omni model☆63Updated 9 months ago
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆342Updated last month
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆57Updated last month
- ☆19Updated 7 months ago
- Daily.co + Pipecat + Tavus AI Avatar Agent☆15Updated 9 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆137Updated 2 weeks ago
- A cross-platform desktop application for running AI models from [WaveSpeedAI](https://wavespeed.ai), as well as many free local AI models…☆97Updated last week