Lamomal / s3prl_correlation
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆8Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for s3prl_correlation
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆16Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆25Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- ☆12Updated 3 years ago
- ☆9Updated 4 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆28Updated 8 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- MSP-Podcast Challenge Baseline Code☆17Updated 5 months ago
- ☆10Updated 5 months ago
- ☆36Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆18Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆20Updated 3 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆44Updated last year
- Repo for the FB AI Speech team.☆22Updated 3 years ago
- ☆9Updated last year
- phone inventory library☆15Updated last year
- ☆31Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆11Updated 8 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆9Updated last year