CherokeeLanguage / cherokee-audio-data
Cherokee Audio data
☆10Updated last year
Alternatives and similar repositories for cherokee-audio-data
Users that are interested in cherokee-audio-data are comparing it to the libraries listed below
Sorting:
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆36Updated 10 months ago
- ☆80Updated 11 months ago
- Heteronym to Phoneme Parser☆18Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆33Updated this week
- phone inventory library☆16Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated 2 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆14Updated 2 years ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆14Updated 5 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A JAX library for building lattice-based speech transducer models☆45Updated 5 months ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆18Updated 2 years ago
- Minimalist Speech-to-Text toolkit for educational purposes☆12Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 8 months ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆11Updated 7 months ago
- Workflow for forced alignment between languages☆18Updated last year
- ☆19Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- ☆11Updated 3 weeks ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆34Updated 3 years ago
- asr2k☆50Updated 11 months ago
- Project of Singing Voice Conversion.☆14Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆22Updated 2 months ago
- Production-ready vocoder using BigVSAN☆11Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆22Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago