shreyas253 / SylNet
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆25Updated last year
Alternatives and similar repositories for SylNet:
Users that are interested in SylNet are comparing it to the libraries listed below
- ☆40Updated 2 years ago
- Workflow for forced alignment between languages☆17Updated 11 months ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Collection of scripts and utilities for reorganizing corpora to use with the Montreal Forced Aligner☆44Updated 3 years ago
- Constrained Permutation Invariant Training, Speech Separation☆44Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 6 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆31Updated last year
- Dynamic time warping (DTW) functions for specifically speech alignment.☆28Updated 8 months ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- ☆32Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- ☆17Updated last year
- ☆12Updated 3 years ago
- ☆12Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated last year
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- ☆22Updated 3 years ago
- Easier analysis of large speech corpora☆22Updated 3 years ago
- Code for AccentDB.☆19Updated 3 years ago