Lamomal / s3prl_correlation
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆8Updated 2 years ago
Related projects: ⓘ
- Balanced Error Rate for Speaker Diarization☆25Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆12Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆15Updated 2 years ago
- ☆30Updated last year
- phone inventory library☆14Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆12Updated 8 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆11Updated last month
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆11Updated 6 months ago
- ☆35Updated 2 years ago
- Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5☆19Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆41Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated 10 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆24Updated 9 months ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆14Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Updated last year
- A CSRankings-like index for speech researchers☆30Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated last year
- A probabilistic scoring backend for length-normalized embeddings.☆10Updated 4 months ago
- ☆9Updated last year
- ☆13Updated last year
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆26Updated 6 months ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆20Updated last year
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆19Updated 7 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆15Updated 6 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆21Updated 6 months ago
- ☆12Updated 6 months ago
- MSP-Podcast Challenge Baseline Code☆12Updated 3 months ago