OminousIndustries / PhoneDriverLinks
Android Phone Control With Qwen3-VL
☆114Updated last month
Alternatives and similar repositories for PhoneDriver
Users that are interested in PhoneDriver are comparing it to the libraries listed below
Sorting:
- Fast local speech-to-text for any app using faster-whisper☆146Updated 3 months ago
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆228Updated 4 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆238Updated 3 weeks ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆46Updated 7 months ago
- Local Reasoning Agent using LangChain + Ollama☆108Updated last month
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 10 months ago
- Run Ollama LLM models in Google Colab for free☆37Updated last year
- AI debugger and AI coder integrated. Use AI to code and drives runtime debugger☆75Updated 3 weeks ago
- A web application that converts speech to speech 100% private☆81Updated 6 months ago
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆221Updated 4 months ago
- The Open Framework for autonomous virtual computer agents at scale, fully open-source, safe, auditable, and production-ready.☆278Updated 2 months ago
- Capture, tag, and search images locally with OSS models.☆44Updated 11 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆50Updated 5 months ago
- Allows two LLMs to communicate and run code in the terminal☆27Updated last year
- Free ComfyUI Workflows☆40Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 7 months ago
- Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.☆56Updated 2 weeks ago
- OLLama IMage CAtegorizer☆70Updated 11 months ago
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 7 months ago
- the AI IDE for work, research, development, and play.☆213Updated last week
- This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models loc…☆125Updated last week
- Retrieval-augmented generation (RAG) for remote & local LLM use☆46Updated 6 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆125Updated 3 months ago
- the multi-agent shell for building agent organizations☆142Updated this week
- Open Source Local Data Analysis Assistant.☆144Updated 2 months ago
- Examples for using Hyperbrowser☆156Updated 2 weeks ago
- Create 3D files in the CLI with Small Language Model☆43Updated 2 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆112Updated 5 months ago
- Web-Navigator is an agent for web browsing and scraping websites.☆177Updated 3 months ago
- VLLM Port of the Chatterbox TTS model☆351Updated 2 months ago