EricWilbanks / faseAlignLinks
Command line tool for forced-alignment of Spanish speech data
☆13Updated 2 years ago
Alternatives and similar repositories for faseAlign
Users that are interested in faseAlign are comparing it to the libraries listed below
Sorting:
- ☆40Updated 3 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Alignment files of LibriTTS.☆64Updated 5 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆44Updated 2 years ago
- Implementation of audio degradation processes☆103Updated 9 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- Simple Python package for fast DER computation☆33Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆45Updated 6 months ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆24Updated 5 years ago
- ☆27Updated 4 years ago
- ☆56Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 2 months ago
- A set of Matlab code for carrying out glottal source and voice quality analysis☆34Updated 12 years ago
- Discriminative Training of VBx Diarization☆25Updated 10 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 8 months ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆36Updated 4 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆30Updated 2 years ago
- ☆54Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆113Updated 9 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- ☆18Updated 3 years ago
- Phonetically-Oriented Word Error Rate☆35Updated 6 years ago
- ☆98Updated 2 years ago
- A simple package for Guided source separation (GSS)☆127Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 2 years ago