tabahi / formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for formantfeatures
- ☆40Updated 2 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Phoneme segmentation using pre-trained speech models☆54Updated 2 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆22Updated 9 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆37Updated last year
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 4 years ago
- ☆55Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆33Updated 10 months ago
- A Python toolbox for speech features extraction☆159Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- ☆28Updated 4 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.☆42Updated 2 months ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆79Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- Util code, issues, discussions☆28Updated 6 years ago
- Speech formant tracking code in Python☆15Updated 11 years ago
- ☆25Updated 2 years ago
- ☆59Updated 4 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 4 months ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆38Updated 3 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆53Updated last year
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆83Updated last year