collabora / whisper-finetuningLinks
Whisper finetuning
☆14Updated 3 months ago
Alternatives and similar repositories for whisper-finetuning
Users that are interested in whisper-finetuning are comparing it to the libraries listed below
Sorting:
- ☆22Updated 11 months ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆25Updated 2 months ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆11Updated 7 months ago
- ☆17Updated 4 years ago
- ☆19Updated last year
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 2 years ago
- ☆12Updated 7 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆22Updated 8 months ago
- ☆24Updated last month
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 2 months ago
- ☆12Updated 9 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆29Updated 2 months ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆14Updated 8 months ago
- ☆18Updated 3 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆9Updated 9 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 5 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆18Updated 2 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆9Updated 8 months ago
- ☆15Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆13Updated 4 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Updated 5 months ago
- ☆16Updated last month
- A Weakly Supervised Forced Alignment for disluent speech☆14Updated last year
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆13Updated 8 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆68Updated 3 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated last year