rorizzz / YOLO-StutterLinks
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆17Updated 3 months ago
Alternatives and similar repositories for YOLO-Stutter
Users that are interested in YOLO-Stutter are comparing it to the libraries listed below
Sorting:
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆20Updated last year
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆146Updated last week
- ☆43Updated 2 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆33Updated 3 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆104Updated 8 months ago
- ☆11Updated 2 months ago
- ☆82Updated 8 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆110Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated last month
- ☆83Updated 2 months ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆42Updated 2 months ago
- TODO☆39Updated last year
- Official repository of NeXt-TDNN for speaker verification☆73Updated 8 months ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 7 years ago
- ☆40Updated last month
- Reference-aware automatic speech evaluation toolkit☆155Updated 6 months ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆21Updated last year
- EMO-SUPERB submission☆44Updated 9 months ago
- A list of papers for child ASR☆42Updated 8 months ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆179Updated 11 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆38Updated last month
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆93Updated 7 months ago
- Self-supervised Speaker Diarization Interspeech 2022 Implementation☆8Updated 8 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆28Updated 6 months ago
- ☆61Updated 8 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆53Updated 4 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 9 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆27Updated last month
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆23Updated 3 months ago