audeering / w2v2-age-gender-how-to
How to use our public wav2vec2 age and gender model
☆34Updated last year
Alternatives and similar repositories for w2v2-age-gender-how-to:
Users that are interested in w2v2-age-gender-how-to are comparing it to the libraries listed below
- multilingual speech aligner☆73Updated last year
- ☆48Updated 2 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆82Updated 2 years ago
- ☆44Updated last year
- ConMamba for Automatic Speech Recognition☆53Updated 5 months ago
- ☆62Updated 4 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- ☆65Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 4 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆59Updated 9 months ago
- ☆30Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆73Updated last year
- ☆63Updated last year
- The official source code of UniAudio☆85Updated 9 months ago
- Speech samples and code of BEdit-TTS☆32Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆51Updated 2 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆84Updated last month
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- ☆64Updated last year
- ☆27Updated last year
- Alignment files of LibriTTS.☆60Updated 4 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆35Updated last year
- ☆23Updated 7 months ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆50Updated last month
- A list of papers for child ASR☆35Updated 3 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆20Updated 3 months ago