Lamomal / s3prl_correlation

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

☆8

Related projects ⓘ

Alternatives and complementary repositories for s3prl_correlation

skakouros / s3prl_attentive_correlation
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Updated 2 years ago
SpeechClub / CDER_Metric
CDER (Conversational Diarization Error Rate) Scoring Tool
☆16Updated 2 years ago
X-LANCE / BER
Balanced Error Rate for Speaker Diarization
☆25Updated last year
wngh1187 / IPET
Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS
☆10Updated last year
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
☆12Updated 3 years ago
BUTSpeechFIT / OOV-recovery-in-hybrid-ASR-system
☆9Updated 4 years ago
facebookresearch / MMCSG
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆28Updated 8 months ago
desh2608 / kaldi-noise-vectors
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆12Updated 3 years ago
msplabresearch / MSP-Podcast_Challenge
MSP-Podcast Challenge Baseline Code
☆17Updated 5 months ago
huangruizhe / ConEC
☆10Updated 5 months ago
Hertin / WavPrompt
☆36Updated 2 years ago
Splend1d / T5lephone
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆18Updated last year
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆23Updated 8 months ago
seungheondoh / hi_kia
wake-up word emotion recognition [APSIPA 2022]
☆17Updated 2 years ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 2 years ago
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 2 years ago
mct10 / CoBERT
Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning
☆46Updated last year
openaudiolab / LLaST
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆20Updated 3 months ago
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆13Updated last month
archiki / ASR-Accent-Analysis
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆13Updated 4 years ago
vectominist / spin
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆44Updated last year
facebookresearch / fbai-speech
Repo for the FB AI Speech team.
☆22Updated 3 years ago
minkjung / blankcollapse
☆9Updated last year
xinjli / phonepiece
phone inventory library
☆15Updated last year
JSALT-2022-SSL / superb-prosody
☆31Updated last year
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated 3 months ago
Speech-Lab-IITM / data2vec-aqc
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆11Updated 8 months ago
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated last year
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆35Updated last month
bagustris / ssl-ser
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆9Updated last year