KrishnaDN / BERTphoneLinks
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
Alternatives and similar repositories for BERTphone
Users that are interested in BERTphone are comparing it to the libraries listed below
Sorting:
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 8 months ago
- A handy dataset of noises for ASR☆21Updated 6 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆31Updated 7 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last month
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- Speech (audio) subjective evaluation system☆38Updated 4 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 8 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆18Updated 9 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- PyTorch based speaker embedding model☆16Updated last year
- ☆12Updated 4 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 8 months ago
- ☆36Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 4 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆17Updated 11 months ago
- ☆16Updated last year
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated 2 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆14Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- ☆24Updated 3 years ago
- ☆23Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago