rorizzz / YOLO-Stutter
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆12Updated 3 months ago
Alternatives and similar repositories for YOLO-Stutter:
Users that are interested in YOLO-Stutter are comparing it to the libraries listed below
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 10 months ago
- ☆43Updated last year
- Official repository of NeXt-TDNN for speaker verification☆65Updated 3 months ago
- ☆63Updated 4 months ago
- ☆21Updated 7 months ago
- ConMamba for Automatic Speech Recognition☆54Updated 5 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆48Updated this week
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆53Updated 7 months ago
- TODO☆37Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 3 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆21Updated last month
- Self-supervised Speaker Diarization Interspeech 2022 Implementation☆9Updated 3 months ago
- ☆19Updated last year
- ☆48Updated 2 months ago
- ☆21Updated 5 months ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆47Updated last year
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆54Updated 4 months ago
- ☆56Updated 3 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆84Updated 2 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆112Updated 2 weeks ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 7 months ago
- ☆55Updated 8 months ago
- ☆30Updated last year
- ☆76Updated 5 months ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆49Updated 3 weeks ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year