MasonPhonLab / MAPS
Mason-Alberta Phonetic Segmenter
☆9Updated 2 months ago
Alternatives and similar repositories for MAPS:
Users that are interested in MAPS are comparing it to the libraries listed below
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆12Updated 4 years ago
- ☆9Updated 5 years ago
- An extension of PHOIBLE that includes features for allophones.☆10Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- ☆16Updated 5 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 6 months ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 6 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- ☆40Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆19Updated last year
- ☆16Updated 6 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- ☆12Updated 2 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 2 months ago
- ☆11Updated 2 years ago
- A handy dataset of noises for ASR☆21Updated 5 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆16Updated 2 weeks ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆14Updated last year
- Easier analysis of large speech corpora☆22Updated 3 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆17Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- ☆11Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- ☆10Updated 6 months ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆27Updated last year