autotunafish / offline_sstLinks
repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commons CC0. https://creativecommons.org/share-your-work/public-domain/cc0/
β19Updated 6 months ago
Alternatives and similar repositories for offline_sst
Users that are interested in offline_sst are comparing it to the libraries listed below
Sorting:
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4β24Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ161Updated last year
- Hebrew grapheme to phoneme (G2P)β79Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervisionβ29Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"β103Updated 5 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNetβ38Updated 11 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ105Updated 5 months ago
- Clone a voice in a few seconds to generate arbitrary speech in real-time in multiple languagesβ54Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β26Updated 3 months ago
- Using OpenVINO to speed up MeloTTS inferenceβ15Updated last year
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β67Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning β¦β26Updated 2 years ago
- Speaker diarization serviceβ25Updated 5 months ago
- an improved version of Real-time-voice-cloningβ52Updated last year
- On-device noise suppression powered by deep learningβ77Updated last week
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)β29Updated last month
- Free Dutch voice datasetβ13Updated 4 years ago
- a simple system for 2-way interruptible voice interactions between human and LLMβ30Updated last year
- Finally, some decent sample sentencesβ23Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcriptionβ23Updated last week
- β31Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β13Updated last year
- A testing repo to share code and thoughts on diarisationβ57Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedbackβ¦β10Updated 2 months ago
- β100Updated last year
- Create an LJSpeech structured voice dataset on wave inputβ36Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- OpenAI Whisper for edge devicesβ132Updated 2 years ago