collabora / whisper-finetuningLinks
Whisper finetuning
☆14Updated 6 months ago
Alternatives and similar repositories for whisper-finetuning
Users that are interested in whisper-finetuning are comparing it to the libraries listed below
Sorting:
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆10Updated 6 months ago
- A Weakly Supervised Forced Alignment for disluent speech☆14Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- ☆17Updated 4 years ago
- ☆19Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆30Updated 4 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆25Updated 10 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 8 months ago
- ☆25Updated last year
- Swarah: Indian-English speech dataset collected across the country☆36Updated 3 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆15Updated 10 months ago
- ☆13Updated this week
- ☆28Updated 4 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Updated 3 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 8 months ago
- ☆14Updated 10 months ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- SubER - Subtitle Edit Rate☆23Updated last month
- ☆16Updated 2 years ago
- ☆12Updated last week
- ☆61Updated this week
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated 11 months ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆11Updated last week
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 6 months ago
- One command to start a streaming ASR server.☆12Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 8 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 7 months ago
- E2E ASR system☆14Updated 2 years ago