OminousIndustries / PhoneDriverLinks
Android Phone Control With Qwen3-VL
☆100Updated 2 weeks ago
Alternatives and similar repositories for PhoneDriver
Users that are interested in PhoneDriver are comparing it to the libraries listed below
Sorting:
- ACE-Step: A Step Towards Music Generation Foundation Model☆45Updated 5 months ago
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆215Updated 3 months ago
- VLLM Port of the Chatterbox TTS model☆325Updated 3 weeks ago
- Run Ollama LLM models in Google Colab for free☆36Updated 11 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆124Updated 2 months ago
- Fast local speech-to-text for any app using faster-whisper☆143Updated last month
- ☆190Updated 7 months ago
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆208Updated 3 months ago
- OLLama IMage CAtegorizer☆70Updated 9 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Updated 8 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆96Updated 4 months ago
- ☆173Updated 2 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆110Updated 2 weeks ago
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆110Updated 4 months ago
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆55Updated last month
- Agent MCP for ffmpeg☆209Updated 5 months ago
- ☆54Updated 5 months ago
- ☆91Updated 5 months ago
- The GPT-4o image generation we have at home. A powerful, self-hosted AI photo stylizer built for performance and privacy.☆485Updated 4 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆27Updated 6 months ago
- ☆300Updated 3 months ago
- The PyVisionAI Official Repo☆104Updated 3 months ago
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆156Updated 2 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Updated 8 months ago
- A web application that converts speech to speech 100% private☆77Updated 5 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 4 months ago
- LLM search engine faster than perplexity!☆364Updated 2 months ago
- Examples for using Hyperbrowser☆143Updated last month
- 🕵️♂️ All-in-one OSINT tool for analysing any website☆23Updated 7 months ago
- Orpheus Chat WebUI☆75Updated 7 months ago