MrAliHasan / Sophia-AI-AssistantLinks
Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, browsing websites, and making calls via phone or WhatsApp. It uses the Hugging Face API for responses and offers activation via voice, text input, or a keyboard shortcut.
☆21Updated 11 months ago
Alternatives and similar repositories for Sophia-AI-Assistant
Users that are interested in Sophia-AI-Assistant are comparing it to the libraries listed below
Sorting:
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆14Updated 4 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 7 months ago
- specifications and documentation for the Open Voice Interoperability Initiative Project☆19Updated 3 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 3 months ago
- ☆16Updated 3 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- A true Artificial Intelligent Assistant with ALICE as backend and offline speech recognition with vosk engine and pyttsx3 as text to spee…☆86Updated last year
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆54Updated 2 years ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆28Updated 3 months ago
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆60Updated 4 months ago
- AI Search engine☆12Updated last week
- Open TTS models, built for streaming on the edge☆43Updated 6 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated 10 months ago
- Text To Speech Multilingual Support (+20 Language)☆50Updated 2 years ago
- An ai music website developed based on Next.js and Suno AI.☆15Updated 9 months ago
- JARVIS AGI || AI Powered Voice Assistant with Real Human Capabilities☆206Updated 9 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆36Updated 8 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆19Updated 3 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 5 months ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Updated last year
- Ai generated music video with Riffusion and Gradio☆22Updated 2 years ago
- Self-hosted AI voice agent☆115Updated last year
- Outbound Phone GPT is a sophisticated prototype for a context-aware agent designed to autonomously handle outbound phone calls.☆17Updated last year
- The objective of the Speaking Portal Project is to design, develop, and deploy a lip-sync animation API for the Kukarella text-to-speech …☆12Updated 2 years ago
- WebSage is an AI Engine that extracts content from any URL, generates summaries, and enables interaction using AI models. Choose between …☆16Updated 7 months ago
- Translated vocal synthesis - Clone a voice and output speech in another language☆26Updated 3 years ago
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆19Updated last year