willianantunes / transcriber-wrapperLinks
Wrapper of well-known transcribers that transform text into phoneme codes
☆15Updated 4 years ago
Alternatives and similar repositories for transcriber-wrapper
Users that are interested in transcriber-wrapper are comparing it to the libraries listed below
Sorting:
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- Example python scripts to evaluate various ASR methods☆11Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- Finally, some decent sample sentences☆23Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 4 years ago
- Creates video from TTS output and viseme images.☆15Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 3 years ago
- BioVoice: a multipurpose tool for voice analysis☆10Updated 5 years ago
- Simple PyTorch Denoisers for Waveform Audio☆38Updated 3 weeks ago
- Python library for audio augmentation☆85Updated 2 years ago
- Handling audio files in Python☆38Updated this week
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Updated 4 years ago
- Convert native orthographies to the International Phonetic Alphabet☆16Updated 6 months ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15Updated 5 years ago
- Karaokey is a vocal remover that automatically separates the vocals and instruments. A deep learning model based on LSTMs has been traine…☆42Updated 2 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆46Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- Forced Alignments for Common Voice☆32Updated 5 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆48Updated 2 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Labeled data for homograph disambiguation☆63Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 6 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆36Updated 3 years ago
- Persian Grapheme-to-Phoneme (G2P) converter☆41Updated last year
- Extract frequency, power, width and dissonance of formants from wav files☆28Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆59Updated 3 years ago