autotunafish / offline_sstLinks
repo of files pertaining to realtime, offline translations using whisper realtime and argos translate. This repo is marked Creative Commons CC0. https://creativecommons.org/share-your-work/public-domain/cc0/
☆19Updated 5 months ago
Alternatives and similar repositories for offline_sst
Users that are interested in offline_sst are comparing it to the libraries listed below
Sorting:
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 4 months ago
- Indic-Conformer models for ASR☆18Updated last year
- Ready-to-use Multilingual Text-To-Speech (TTS) package.☆24Updated 2 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆22Updated 2 months ago
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated last year
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- ☆18Updated 3 years ago
- Normalize Text in Russian☆28Updated last year
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆73Updated 2 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated 2 years ago
- Russian accentuator and IPA transcriber☆15Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆28Updated last week
- Speaker diarization service☆24Updated 4 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Using OpenVINO to speed up MeloTTS inference☆13Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆254Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆53Updated 2 years ago
- Python text-to-speech library with built-in voice effects and support for multiple TTS engines☆25Updated 7 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- OpenAI Whisper for edge devices☆130Updated 2 years ago
- Free Dutch voice dataset☆13Updated 4 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆25Updated 2 months ago
- Whisper from OpenAi and diarization with Pyannote☆49Updated last year
- Dippy Synthetic Speech Subnet☆17Updated last month