amritkromana / disfluency_detection_from_audio
☆21Updated 8 months ago
Alternatives and similar repositories for disfluency_detection_from_audio
Users that are interested in disfluency_detection_from_audio are comparing it to the libraries listed below
Sorting:
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 9 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆53Updated 3 weeks ago
- ☆51Updated 6 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆90Updated 5 months ago
- multilingual speech aligner☆74Updated last year
- ☆43Updated 2 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆52Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆32Updated 7 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆36Updated last year
- ☆38Updated 7 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last month
- ☆64Updated 3 weeks ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- Official implementation of MelHuBERT☆65Updated 6 months ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆26Updated last week
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆66Updated 8 months ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆21Updated last year
- A toolkit dedicate for speech evaluation.☆19Updated 7 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 3 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆32Updated 2 weeks ago
- A list of papers for child ASR☆40Updated 7 months ago
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆10Updated 8 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 11 months ago
- MSP-Podcast Challenge Baseline Code☆22Updated 11 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆22Updated 2 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆29Updated last year