OminousIndustries / PhoneDriverLinks
Android Phone Control With Qwen3-VL
☆123Updated 3 months ago
Alternatives and similar repositories for PhoneDriver
Users that are interested in PhoneDriver are comparing it to the libraries listed below
Sorting:
- Fast local speech-to-text for any app using faster-whisper☆146Updated 4 months ago
- Human-taught Computer-use Agent Designed for Real Windows and MacOS Desktops.☆160Updated last week
- Dashboard v5 Coming Soon!!☆63Updated 3 weeks ago
- The Open Framework for autonomous virtual computer agents at scale, fully open-source, safe, auditable, and production-ready.☆301Updated last month
- A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automat…☆142Updated this week
- Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiob…☆233Updated 5 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆131Updated 2 weeks ago
- AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…☆225Updated 3 months ago
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆249Updated last week
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆130Updated 5 months ago
- Chain apps and models to build robust AI workflows 🤗☆203Updated this week
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆269Updated last month
- ACE-Step: A Step Towards Music Generation Foundation Model☆47Updated 8 months ago
- A cross-platform desktop application for running AI models from [WaveSpeedAI](https://wavespeed.ai), as well as many free local AI models…☆93Updated this week
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆60Updated 2 months ago
- Local modular AI assistant with speech, vision, and robotics support. Uses Qwen3-VL-4B in LM Studio.☆51Updated 3 weeks ago
- Create 3D files in the CLI with Small Language Model☆43Updated 3 months ago
- VLLM Port of the Chatterbox TTS model☆364Updated 3 months ago
- BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to …☆246Updated 5 months ago
- Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web …☆124Updated last month
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 8 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 3 weeks ago
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Updated 2 months ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆565Updated 2 months ago
- A web application that converts speech to speech 100% private☆82Updated 7 months ago
- Test your local LLMs on the AIME problems☆31Updated 7 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆382Updated last week
- ☆178Updated 5 months ago
- Memory that learns what works.☆109Updated this week
- Local AI voice assistant stack for Home Assistant (GPU-accelerated) with persistent memory, follow-up conversation, and Ollama model reco…☆227Updated 6 months ago