prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆65Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for OpenAI_Whisper_ASR
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆47Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆98Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- ☆56Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- VALL-E 2 reproduction☆83Updated 3 months ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- Adaptive Vocoder for Custom Voice☆58Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year
- Your one-stop solution for voice dataset creation☆112Updated 10 months ago
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- ☆32Updated last month
- An unofficial PyTorch implementation of VALL-E☆75Updated this week
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 3 months ago
- ☆17Updated 3 months ago
- ☆33Updated last year
- ☆43Updated 4 months ago
- A simple voice conversion tool☆15Updated 2 years ago
- ☆23Updated last year
- ☆69Updated last year
- Finetuning VITS Efficiently☆32Updated last year