MrAliHasan / Sophia-AI-Assistant
Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, browsing websites, and making calls via phone or WhatsApp. It uses the Hugging Face API for responses and offers activation via voice, text input, or a keyboard shortcut.
☆16Updated 6 months ago
Alternatives and similar repositories for Sophia-AI-Assistant:
Users that are interested in Sophia-AI-Assistant are comparing it to the libraries listed below
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆18Updated 2 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- StyleTTS 2 Optimized Training Fork☆27Updated 2 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 7 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆12Updated 4 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 9 months ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆9Updated 6 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Translated vocal synthesis - Clone a voice and output speech in another language☆25Updated 2 years ago
- ☆10Updated this week
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆39Updated 4 months ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- Text To Speech Multilingual Support (+20 Language)☆43Updated last year
- The Vokan Architecture (Tsukasa speech based)☆9Updated 2 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆14Updated last month
- Real-time end-to-end singing voice convertion☆21Updated 5 months ago
- Russian phonetical transcription☆10Updated last year
- Voice assistant with audio input and audio output using Whisper and Eleven Labs☆11Updated this week
- ☆14Updated 9 months ago
- ☆13Updated 8 months ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 2 months ago
- A ChatGPT based Computer Assistant☆10Updated last year
- create dataset from list of youtube links easily☆17Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 7 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 5 months ago
- ☆11Updated 2 years ago
- ☆11Updated 9 years ago