neurlang / datasetLinks
IPA Phonetic dataset lexicon
☆18Updated this week
Alternatives and similar repositories for dataset
Users that are interested in dataset are comparing it to the libraries listed below
Sorting:
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Updated 9 months ago
- ☆14Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 11 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 11 months ago
- IPA Phonemizer/Dephonemizer for 144 human languages☆50Updated last week
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 2 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 3 months ago
- ☆18Updated last year
- ☆29Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- ☆18Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 5 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆39Updated 7 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- This is the experimental description of MnTTS2.☆11Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated last month
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 11 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 3 months ago
- ☆13Updated 3 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 9 months ago
- Unofficial implementation of wavenext vocoder☆53Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆49Updated this week