collabora / whisper-finetuningLinks
Whisper finetuning
☆15Updated 8 months ago
Alternatives and similar repositories for whisper-finetuning
Users that are interested in whisper-finetuning are comparing it to the libraries listed below
Sorting:
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Updated 9 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- ☆17Updated 4 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 9 months ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆39Updated 7 months ago
- ☆14Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- ☆15Updated last week
- ☆16Updated 8 months ago
- Swarah: Indian-English speech dataset collected across the country☆37Updated 5 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 2 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆29Updated last year
- ☆74Updated 2 months ago
- ☆13Updated 4 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 2 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated 2 years ago
- Pybind11 bindings for Kaldi☆15Updated 3 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- ☆18Updated last year
- Text-to-Speech Latency Benchmark☆22Updated 6 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆31Updated last month
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆19Updated last year
- ☆17Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 6 months ago