neurlang / datasetLinks
IPA Phonetic dataset lexicon
☆18Updated 3 weeks ago
Alternatives and similar repositories for dataset
Users that are interested in dataset are comparing it to the libraries listed below
Sorting:
- IPA Phonemizer/Dephonemizer for 140 human languages☆54Updated last month
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Updated 10 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆34Updated 2 years ago
- Pybind11 bindings for Kaldi☆15Updated last week
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- The EveryVoice TTS Toolkit - Text To Speech for your language☆42Updated last week
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆51Updated this week
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 3 months ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 4 months ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- ☆32Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Unofficial implementation of wavenext vocoder☆57Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- ☆18Updated 2 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 4 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Updated 9 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated 3 months ago
- Whisper finetuning☆15Updated 10 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 6 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago
- Crowdsourced and Automatic Speech Prominence Estimation☆24Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆34Updated last month
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 3 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated last year