willianantunes / transcriber-wrapperLinks
Wrapper of well-known transcribers that transform text into phoneme codes
β15Updated 4 years ago
Alternatives and similar repositories for transcriber-wrapper
Users that are interested in transcriber-wrapper are comparing it to the libraries listed below
Sorting:
- Streamlit app to visualize and edit TTS datasetsβ15Updated 4 years ago
- π΅ A repository for manually annotating files to create labeled acoustic datasets for machine learning.β46Updated 3 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 3 years ago
- Karaokey is a vocal remover that automatically separates the vocals and instruments. A deep learning model based on LSTMs has been traineβ¦β42Updated 2 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!β25Updated 11 months ago
- Example python scripts to evaluate various ASR methodsβ11Updated 4 years ago
- Creates video from TTS output and viseme images.β16Updated 3 years ago
- Python library for audio augmentationβ85Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- Finally, some decent sample sentencesβ23Updated 2 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β26Updated 4 years ago
- ParallelWaveGAN adaptation for Mozilla TTSβ15Updated 5 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β80Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audioβ40Updated last month
- Phoneme prediction from speech mel-spectrograms using RNN.β15Updated 6 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.β13Updated 3 years ago
- Tools to create your own voice dataset for TTS trainingβ70Updated 5 years ago
- Burn captions (.srt) into videosβ10Updated 2 years ago
- Python C extension for the eSpeak speech synthesizerβ12Updated 5 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segmentsβ43Updated 4 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-spβ¦β58Updated 6 years ago
- π« check your data, before you wreck your modelβ16Updated 3 years ago
- TTS Client for Coqui TTS serverβ13Updated 3 years ago
- Labeled data for homograph disambiguationβ62Updated 2 years ago
- Voice analysis software (Python port of VoiceSauce)β60Updated 6 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ34Updated 5 years ago
- Breaks a word into syllables using an LSTM-based neural network.β20Updated 2 years ago