prateekralhan / OpenAI_Whisper_ASR
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
☆66Updated 2 years ago
Alternatives and similar repositories for OpenAI_Whisper_ASR:
Users that are interested in OpenAI_Whisper_ASR are comparing it to the libraries listed below
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆56Updated this week
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using …☆28Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- ☆56Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆25Updated 4 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- Text To Speech Multilingual Support (+20 Language)☆41Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆43Updated this week
- A simple voice conversion tool☆17Updated 2 years ago
- ☆36Updated 4 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆38Updated 2 years ago
- ☆19Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Putting flows on top of neural transducers for better TTS☆61Updated last week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆92Updated 4 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- ☆20Updated 2 years ago
- OpenAI Whisper Prompt Examples☆50Updated last year
- Speaker diarization service☆21Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- VoiceBox neural network implementation☆101Updated 6 months ago
- Official Code for ParrotTTS☆49Updated 4 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 2 months ago
- Text-To-Speech for NotebookLM☆29Updated last month