collabora / whisper-finetuningLinks
Whisper finetuning
☆15Updated 9 months ago
Alternatives and similar repositories for whisper-finetuning
Users that are interested in whisper-finetuning are comparing it to the libraries listed below
Sorting:
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Updated 10 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Updated 10 months ago
- ☆17Updated 4 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆33Updated this week
- ☆75Updated 3 months ago
- ☆14Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆39Updated 8 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 3 months ago
- ☆18Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 11 months ago
- ☆13Updated 3 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 11 months ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Updated 2 years ago
- ☆27Updated last week
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 5 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Updated 2 years ago
- ☆29Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 3 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 3 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Updated 5 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 6 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 3 months ago
- ☆29Updated 3 weeks ago