willianantunes / transcriber-wrapper
Wrapper of well-known transcribers that transform text into phoneme codes
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for transcriber-wrapper
- Novoic's linguistic feature extraction library☆35Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 2 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15Updated 4 years ago
- Voice analysis software (Python port of VoiceSauce)☆55Updated 5 years ago
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible…☆38Updated last month
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 4 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago
- Convert native orthographies to the International Phonetic Alphabet☆13Updated 2 years ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- Deep learning for Text to Speech☆26Updated 3 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆41Updated 2 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆41Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆61Updated 4 years ago
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆51Updated 2 months ago
- Collect Voice Conversion researches☆90Updated this week
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆81Updated 6 months ago
- Python library for audio augmentation☆83Updated last year
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆22Updated last year
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 3 years ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆83Updated last year
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- Mellotron singing synthesizer using CPU☆13Updated last year