MrAliHasan / Sophia-AI-Assistant
Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, browsing websites, and making calls via phone or WhatsApp. It uses the Hugging Face API for responses and offers activation via voice, text input, or a keyboard shortcut.
☆13Updated 4 months ago
Alternatives and similar repositories for Sophia-AI-Assistant:
Users that are interested in Sophia-AI-Assistant are comparing it to the libraries listed below
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- a simple system for 2-way interruptible voice interactions between human and LLM☆22Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆10Updated 2 months ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆16Updated last week
- StyleTTS 2 Optimized Training Fork☆22Updated 2 weeks ago
- ☆9Updated 4 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆10Updated 3 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 10 months ago
- A lightweight Python library for running TTS models with a unified API.☆16Updated this week
- ☆11Updated 3 months ago
- ☆12Updated 6 months ago
- Real-time end-to-end singing voice convertion☆19Updated 3 months ago
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆19Updated 9 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆18Updated last month
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 3 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 4 months ago
- ☆12Updated 2 years ago
- Text To Speech Multilingual Support (+20 Language)☆41Updated last year
- ☆14Updated last year
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 7 months ago
- Russian phonetical transcription☆9Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆29Updated 9 months ago
- ☆11Updated 9 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 7 months ago