gongouveia / Whisper-Synthetic-ASR-Dataset-GeneratorLinks
This UI serves as a Synthetic ASR Dataset Generator powered by/for OpenAI Whisper, enabling users to capture audio, transcribing it, on the fly and manage the generated dataset š¤. Fine tune Whisper or enhanced and custom datasets
ā29Updated 6 months ago
Alternatives and similar repositories for Whisper-Synthetic-ASR-Dataset-Generator
Users that are interested in Whisper-Synthetic-ASR-Dataset-Generator are comparing it to the libraries listed below
Sorting:
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā62Updated last week
- Simple PyTorch Denoisers for Waveform Audioā35Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā98Updated 8 months ago
- ā30Updated 2 years ago
- ā103Updated last week
- Speaker diarization serviceā23Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.ā137Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.ā35Updated 2 years ago
- A testing repo to share code and thoughts on diarisationā55Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus filesā64Updated last month
- ā86Updated 8 months ago
- IPA Phonemizer/Dephonemizer for 139 human languagesā27Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.ā83Updated last year
- Audiogen Codecā137Updated 11 months ago
- Repository contains code to fine-tune WhisperASR modelā23Updated 2 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpusā14Updated 4 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.ā31Updated 2 years ago
- ā36Updated last month
- Tools to create your own voice dataset for TTS trainingā66Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformerā51Updated 2 weeks ago
- Create training data for training a voice cloner for bark text to speech.ā45Updated last year
- ONNX Inference of Pyannote Segmentationā90Updated 5 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based ā¦ā136Updated 3 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsā66Updated 2 years ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.ā91Updated 3 weeks ago
- Character-aware audio-only subtitlingā23Updated 3 weeks ago
- Open TTS models, built for streaming on the edgeā43Updated 2 months ago
- ā27Updated 4 months ago
- An unofficial PyTorch implementation of VALL-Eā87Updated last week
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.ā42Updated 3 years ago