LexicalStressDetection / lexical-stress-detectionLinks
Deep Learning model for lexical stress detection in spoken English
☆29Updated 5 years ago
Alternatives and similar repositories for lexical-stress-detection
Users that are interested in lexical-stress-detection are comparing it to the libraries listed below
Sorting:
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 2 years ago
- ☆25Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Goodness of Pronunciation (GOP) for oral reading assessment.☆52Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆46Updated 3 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆60Updated 4 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆26Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- ☆40Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- ☆39Updated 9 months ago