rorizzz / YOLO-StutterLinks
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆17Updated 3 months ago
Alternatives and similar repositories for YOLO-Stutter
Users that are interested in YOLO-Stutter are comparing it to the libraries listed below
Sorting:
- Layer-wise analysis of self-supervised pre-trained speech representations☆103Updated 7 months ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 7 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆19Updated last year
- ☆43Updated 2 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆30Updated 3 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆91Updated 6 months ago
- Official repository of NeXt-TDNN for speaker verification☆71Updated 7 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆96Updated 2 years ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆27Updated 5 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆56Updated 11 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆37Updated 3 weeks ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆38Updated last week
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated last month
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- ☆9Updated last month
- ☆54Updated 7 months ago
- ☆19Updated 2 years ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆145Updated last month
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆83Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 5 months ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆23Updated last week
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆21Updated last year
- ☆30Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆26Updated last year
- INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"☆115Updated last year
- ☆56Updated last year