neurlang / datasetLinks
IPA Phonetic dataset lexicon
☆18Updated this week
Alternatives and similar repositories for dataset
Users that are interested in dataset are comparing it to the libraries listed below
Sorting:
- Transfer learning approach to pronunciation scoring☆11Updated last year
- IPA Phonemizer/Dephonemizer for 144 human languages☆50Updated last week
- Pybind11 bindings for Kaldi☆15Updated 3 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆33Updated 2 months ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Updated 9 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- High quality text-to-speech based on StyleTTS 2.☆71Updated 3 weeks ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 5 months ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated last month
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆47Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 11 months ago
- ☆29Updated last year
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 9 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 2 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 9 months ago
- Unofficial implementation of wavenext vocoder☆53Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- Online streaming speaker change detection model in Pytorch☆43Updated 2 years ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆49Updated this week
- ☆17Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 11 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year